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ABSTRACT 


X 


Frequency analysis of hs^drologle data i's a knowledge 
domain whorr* lot of decisions have I r> bn l aken hnsf'd on human 
expertise, intuition and subjective judgement F requencj' analysis 
being data dependent is subject to many errors Moreover, there is 
rio fixed distribution that can be fitted to a given set of data. 
An ear J ier study dealt with the structure and development of a ES 
baser} computer program PAOHVES The Expert sj f stern is embedded in 
the FORTRAN program The program is written in FORTRAN language 
and the expert system has been developed for the VAX-VMS 
environment This study deals with modification and enhancement of 
capabilities of the earlier version 

In the present version, a number of subroutines have 
been added which helps in a more detailed analysis of the data 
set For example, the subroutine OUTLIER identifies the number and 
the values of the outliers present In the data, the subroutine 
CONFBND is a plotting subroutine which plots the fitted 
distribution, the sample data and the confidence bands. Other 
subroutines include FREQPL and TPLOT for plotting, MLS for method 
of least squares and GOFITST for comparing theoretical and actual 
Chi square values in the goodness of fit. host In the given 
program 

i. a distribution can be fitted to the data after taking Expert 
system advice, 

ii the given set of data can be tested for outliers, 

ill if outliers are present, parameters can he re estimated by 

Ml.fi 

iv . the goodness of fit tested by Chi square test and 
v i ho fitted distributions visualised gr apliica l i v by seeing the 
plots and 

vj which distribution is mere appropriate to the given set of 

data may then bo decided 

Hermo, the modifications done has enhanced the capabilities in 
fitting a distribution 

The developed package of FACHVES Ver 2 0 has beeh tested 
with three sets of data r cpresen ting recorded data at two 
different sites in India The rr nulls of this study verify the 



in 1 1 1 f nn for v rerf ormnnoe of fhn pnrdtnflo ,1n pnrf i^uLar the 
enhancements of outlier tent and Rraphlc out, put 
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1 N 1 Rt >1>U( 1 J ON 


1.1 Gorier al 

In this ago of f’omputnn , eng \ nor»r In g design and 
analysis attained g renter heights Fast and efficient computing 
made several cninplrx analynl r i I prlinl qno r i , eg, numerical 
analysis, statistical analysis etc , easy The programs for 
engineering problems written in higher level languages like 
FORTRAN, BASIC, 0 and PASCAL , were able to solve complex problems 
involving a number of ordinary and partial differential equations, 
logical constraints , or analyse problems involving uncertainties 
Dovelopment of languages like LISP and PROLOG, In which logical 
approaches were easier to be programmed , facilitated the design 
of a system whi cli could tackle l he problems that depended upon 
in tuition and subjective Judgement of the analyst and could not be 
solved by routine analytic techniques This led to the development 
of domain specific systems after the realisation that creating 
universally wise systems was not possible Subsequently , it was 
realised, that while expertise may bn different f mm pinhlem area 
to problem area, the structure of the problem may be the same and 
so several Expert System shells were developed It may generally 
be possible to develop and implement Export Systems in a problem 
area according to the grammer of the shell used 

Water resources engineering deals with many processes 
which vary In space, time and also probnbl Mil 1 on I ly Many a time 
decisions are to bo taken on the basis of past experience, 
intuition n \ r Hoy envoy data available f nr analysis are also 
subject to many observational or computational errors So problem 
sol vi rig techniques in this field of engi nee r i rig design also need 
such a system whi oil con incorporate the domain specific expertise 
with computational techniques 

Thus, in computer aided analysis and design in 
engineering, numerical, analytical and logical caps hi 11 ties of a 
digital computer are extended with the heuristic' capabilities of 
artificial intelligence and expert systems in efficiently solving 
problems in the areas of analysis and design 


1.2 TS in Water Resources Eiiqi nrar*r l nq ^ 

A / U f 1 r I ^ I Intelligence (A I) I f i Ural pa t l of fomputor 

science concerned with designing intelligent compute! systems, 

i e , systems tlia t exhibit the intelligent character 1 sties of 

human tie \ rips like uniki s tniicll rip lnriRinflr>, lr»ainlnfl, i nmonlng, 
solving problems and so on A I is a vay to Impart Intelligence to 
a computer (Levine, Id BP.) 

The 1 In id nf A I eneoinpns ses irijotloc, name r laying, the 
automated translation of language and Expert System (ES) ES is 
concerned with the development of computer software that can 
partially represent human knowledge and utilise that knowledge to 
solve complex problems within a specific domain (.Johnston, 198b) 

Water resources engineering deals with planned 

development and management of walei, the spatially and 
s tochns l 1 on 1 1 v varying natural resource nlmvr u id bo ! nw g murid A 
large number of uncertainties which can arise due to hydrology, 
hierarchical and mill 11 stage nature of decision making, financial 
and economic variabilities and implementation techniques, social 
constraints and changing national conditions, are involved in 
designing a water resource system, vis , planning, development, 
design and management So, a large amount of domain dependent 
expertise is used in planning, design, construction and integrated 
operation of water resource ays terns and they are often heuristic 
in nature 

ES are being developed in areas of water resources like 
database management, Information systems and water quality 
management (Smolior, 1985 , Datta and Peralta, 1986 , Arnold ©t 

al, 1989 , Simonovic, 1989 , Datta et al , 1990) and selection of 

design data as storm (Nielson, 1906) or flood estimation (Fayegh 
and Russel, 1986) arid appropriate treatment technology for water 
supply / sewage (Arnold, 1 906) P.F> has also been nppl i ad for data 
analysis (Wilson, 1986) , hydro! ogle modelling and parameter 

estimation (Engman et a 1 , 1906 , Delluer, i960) , tank irrigation 
system (Oswald, J 9 89), and choice of model to bo used and 

preparation of Input, data for a reservoi r system (Ltndberg and 
Nielson, 1986) Also management, of multipurpose r.j'S tern of 
reservoirs integrated with conjunctive use of surface water and 
ground wa ter are com pi c x pr obi rams wh 1 oh need human expertise, 
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common 'Jf’mo awl hmiriaUcn EH ] I ke BkHPES for Towage 
rehahi ! 1 bntjnns planning process and WADNFB for handling 
emergences in a water supply network (Ahmad et al , 19B9) have been 
successful ] y implemented 

Tlie concept of combin 1 ng the docis Lon making 
capabilities of an expert decision support system and a 
Geographical Information By stein ( Tn he 1 ] 1 gorice GIB) was proposed by 
Arnold e t al (1909) The primary purpose of such a system is to 
Incorporate heuristic know 1 edge regarding men su remen t and other 
un certainties in the estimation of hydrologic or water quality 
variables These applications demonstrate the possibilities of ES 
applications to water resource engineering 

1*3 Objectives of the Study 

Frequency analysis of hydrologic data is a preliminary 
step in the design of hydraulic structures, control of extreme 
hy drologl c events, man a gem en t of water resources e be f hi3 field 
is suitable for EG application as lfc deals with data lacking in 
qual i ty and quan Li ty , having observa L i orinl arid compn ta hi onal 
errors, and sparseness Moreover there is no universally 
acceptabl e me thodology for fitting a probah i 1 i l v d 1 s tr l bn hi on to a 
given sot of data and, if at all, only a limited knowledge about 
the parent distribution of the samples may be available. Physical 
processes resul ti ng in a high or low precipitation or stream flow 
are also not well understood Often the presence of outliers, 
measurement errors or occurrence of rare combi mi I ioris of physical 
processes or multiple dis tribu tions makes the problem more 
complicated Hence an efficient approach to frequency analysis 
requires 1 ncorpni a tiorv of statistical inference tools ns well as 
heuristic knowledge of knowl edgeabl n human experts in tills field. 
A frame work has been developed for frequency analysis of 
hydrologic data (FAGHVEG Ver 2 0) and is available for the study 
The major objectives of the study are as follows . 

1 to include graphic capabilities and thus enhance the process 
of fitting an sppropr 1 a Lo distribution b.v visual observation of 
the dels relationships , 

2 to test for outtieis in different distributions nnd its 


analysi s , and 



3 incorporate them in an Expert System available for the study 
(FACHVES) 

1 . 4 Scope of Study 

A variety of computer programs In FORTRAN language are 
available in I IT Kanpur for fitting one or more probability 
distributions A number of computing environments like the HP 9000 
series with a !JN T X operating system, a Ml CPOVAX IT computer ss'Stern 
with a VMS-VAX operating system and IBM PC ’s are available for 
the study The scope nf etudr was limited to 

1 enhancement of FACHVES, a E'ortran based program with E5 tool 
(CLIPS, as identified in Sec 314), in a micro VAX-VMS 
environment , 

2. Graphics facilities on the text screen itself and 

3 a li mited number of outl ler tes is 

1.3 Oi ganlsaLion of the Study 

The study is reported in the following sequence 

1 Introduction to Export systems , appl ioaUons in water 
resources engineering , objectives ; scope arid organization of the 
study (Chapter 1) 

2 An outline of frequency analysis, e g , about different 
probability distributions, Identification of distributions, 
estimation of parameters, identification and tests for outliers 
for different distributions, confidence bands, and tests for 
goodness of fit (Chapter 2) 

3, A brief introduction to export n> stems, building expert 
systems, CLIPS as an ES shell and its application and the salient 

F 

fen turns of FACHVES (Frequency analysis for continuous hydrologic 
variables with embedded expert system) which has already been 
developed (Chapter 3) 

4 The modifications to the FACHVES program, structure and 
introduction to the use of FACHVES Ver 2 0 (Chapter 4) 

5 Pal, a used, the results, discussions and conclusions for 
FACHVES Ver 2 0 (Chapter 5) and 

6 Summary, conclusions and suggestions (Chaplet 6) 
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2- 1 Introduction 

Hydrologies variables like evnporai Lon, precipitation, 
streamflow, etc vary seasonally and also in a probabi l i stio 
manner The random variation of the variables can be represented 
in terms of the frequency distribution of the variables The 
fitting of a probability distribution to a random characteristic 
or variable based on available limited data concerning the 
variable i s an important area of study in Hydrology Hydrologic 
systems are also affected by extremes of random variables such as 
severe s torms , floods , and droughts The magni tude of a random 
event is inversely related to its f requency of occurrence, very 
severe events occurring loss frequently than more moderate events 
The objective of frequency analysis of hydrologic data is to 
relate the magnitude of random variables to their frequency of 
occur! ence through the use of probabi H Ly distributions The 

hydrologic data analysed are generally assumed to he for an 
independent and identically distributed variable and the 
hydrol ogle sys tem producing them ( e g , a storm i at rtf ai 1 system ) is 
considered to be stochastic, space-independent and 

time- independen t The hydrologic data employed should be carefully 
selected so that the assumptions of independence and identical 
distribution are satisfied 

The results of frequency analysis can be used for many 
engineering purposes; vis , for the design of dams, bridges, 
cul verts , arid flood con trol a true tn res , to de terml no the economi a 
value of conservation and flood control projects; and to delineate 
flood plains and determine the of foot of encronchmeri ts on the 
flood plain 

P. P Pi ohaM 1 i t y f line 1 i on*; 

The Pphn uw fi i»rju$uc v /imrhon fn-(x) may be defined as 
the ra ti o of the number of observations m. tn ini erv al 1 (the 
feasible range of the random variable is divided into discrete 
intervals), to the total number of obser vat ions n , which is 



caji'u] a Led from sample data 

1~ ( X- ) n* /n ( ? L ) 

The sum of the values of \ he i r ] ntive 1 j equonci es upto a 

given point/ Is Llie C uiiiu I fJ i i up j ' r qu r=>n.cy /u>r ^ io?) R ( x ) , which Is 

also calculated from the sample data 

L 

Ffe(» ) - E *>(*) (2 2) 

i i 

The our responding fun eh Ions for the pope lab 1 on are 
approached as limits n f x and Ax - > 0 (Ax Js 1/he class interval) 
In the limit, the relative f loquoricy function dLvlded by the 
interval length Ax becomes the Pi okn6? li f y cmiisi ty /umhioa f(x), 

f(x) = lim [Mx * Ax) - JWx)J/Ax (2 3) 

n — » rx 
A X —V 0 

and the cumulative frequency function becomes the rro6a67.fi iy 
di^tr itur i on / ime ? i. on , F ( x ) 

F(x) - lim R(x) (2,4) 

n — > rx 

A X — V 0 

Since the sample data are available the distributions 
and the density functions are expressed Jri bnms of parameters of 
the sample which are assumed to be equal to the population values 
ft 1 Par rtme t ei s nf a rllsfi I hut Inn 

The functional form of ihe dlst t ibution is expressed In 
terms of the par ame Lars of the d I si ? i bul I mi The s la Lis I J cs of any 
distribution arc the Moan, Median, Mode, the standard deviation , 
range, coefficient of variation, skewness and kurtosis The 
statistics are determined as follows 

Arithmetic mean of a set of observations is their sum 
divided by the number of observations, e g , the arithmetic mean x 
of ri observations Xb Xi, , , , Xn is given by 

x = (ft -i X? i , , Xn)/n - 1/n £ (2 6) 

i 1 n 

Median of a dls tri hution is the value of the variable 
which divides the distribution into two equal parts such that the 
number of observations above 11 J s equal lo the numbo r of 
observations below it Thus median is a positional avoi age in 
case of ungi ouped data, If the number of observations in odd then 
median is the middle value nfboj the values have boon ai ranged in 
ascending or descending order of magnitude, In case nf ev^n number 
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of observations, there are two middle teims nrul tho modi ran ifl 
obtained by taking the arithmetic menu of the middle hoi m". 

Mode is the value which occurs most frequently in a set 
of observations and around which the other items of the set 
cluster densely So mode is the value of x oor r espnnd ins to 
maximum frequency 

The standard deviation is the positive square root of 
the arithmetic mean of the squares of the deviations of the given 
values from their arithmetic mean For the frequency distribution 
whose sample values are Xj with a frequency of f i , for v = 
I, 2, n, the sample standard deviation is 

s = £ f l(xj - x)// _ (N r n (2 6) 

i 

where x is the arithmetic mean of the distribution and £ ft = N. 

i 


The square of the standard deviation is called the variance 

Range is the difference between the two extieme 
observations of the distribution Range is i he simplest but a 
crude measure of dispersion of the values of the variable 

Coefficient of variation (CV)is the percentage variation 
in the moan, standard deviation being considered as thn variation 
parameter with respect to the mean 


CV % = 100 jii / o' (2 7) 

The series with greater CV is said to be more variable than the 
other. The series having lesser CV is said to be more consistent 
than the other 

Skewness is studied to have an idea about the shape of 

the curve that can be drawn with the help of the given data, A 

distribution is said tn skewed if 
i ) Tho mean, median and mode fall at different points or 
li) The pdf curve rkown for a given set of data in not s>mmetr1cal 

but strotohod more to one side than bo the other A sample 

estimate of coefficient of skewness is 


Tho skewness Is posit 1 vp 
lies towards the higher 
negative in the contrary 


n L (jo -v I * 1 

^y I 'I' „ 

( n~) ) ( n-f! ) s' 

If I lin 1 a r no r till 
values of the variate 
case (Fig ? 1 ) 


( 2 . 0 ) 

of 1 he d 1 s 1, r i hw t \ on 
(the right) and is 
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Kurtosis gives an idea about the convexity of the curve. 
It indicates about Ilia flatness or peakedness of the curve 

2.4 Probability til s; l i i but i ons 

Some of the dis ti ibubions commonly used in hydrology and 
their density functions are discussed below 
2. 4. 1 Normal anti I ny Hoi rn.il tHxtr i bui 1 ons 

A random variable X is said to have a Normal 
distribution with parameters m (mean) and <variance) (Bee. 2.4) 
if its density function is given by 

i x ~ t-i 

f(x , n.or) = — w — Exp [ -1/2 { — 5 — f 1 , (2 9) 

- c* < x < oc , - oc < /j < dc , a > 0 

The Normal probability curve is bell shaped (Fig. 2 2) and 
symmetrical about the line x - h For large values of a , the 

'curve tends to flatten out and for small values of ty, it has a 
sharp peak Tho mean, median and mode of the distribution 
coincide The standard Normal variate is given by 

Z = (X - U)/* (2.10) 

with E(Z) = 0 and var(Z) 1, 



Fig ? 2 Normal probability curve 
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HydroLngic variables such as annual precipitation are 
often l lie sum of many independent events end they tend to follow 
the norma] til alrlbutinn The main limitations of normal 
distribution for describing hydro 1 or ic variables arci that it 
varies over a continuous range [ -n , a ] while most hvdrologic 
variables are non-negative and and they tend to be skewed 

Consider the exceedence probability, vis , the 

probability F(X >: x) 

P(X > x) = i - P(X < x) 

= l ~ F ( x ) (2 ii) 

Then the recurrence interval, T, which is the average interval 
between successive equalling or exceedence of x is given by 

T = 1/P 

= 1/(1 - F(x) ) { 2 12) 

The value of Z corresponding to an exceedence 

probability of P (P = J /T ) can be calculator] for the Normal 

distribution by finding the value of an intermediate variable w , 

w = C In (i/p ))' /? ( 0 < P < 0 5) (2 13) 

and then calculating Z using the approximation 

2 515517 l 0 802853w + 0.010328w 2 
Z = w r T (2 14) 

l 1 l 4T2 708W l 0. LR9269v? \ 0 00I300W 

When P > 0 5 , 1-p is substituted for p in the above equation and 

then the value of Z computed is given a negative sign ( Abramowi ta 

and Stegun, 1965) 

If the random variable Y - log X is normally distributed 
then X is said to be logriormally distributed Chow ( 1954) 
reasoned that this distribution is applicable to hydrologic 
variables formed as the products of other variables since if X - 
Xi XzX3 , . . , Xn , then 

Y = log X = E loff XI = E Yl (2.15) 

l - 1 ,Tl , T I 

which to rids to the normal distribution for large n provided that 
the Xi are independent, and identically distributed. The log-normal 
distribution has been found to describe the distribution of 
hydraulic conductivity in a porous medium, the distribution of 
rain drop si sen in a strum, o to The log normal distribution is 
bounded (x 0) and the logarithmic transformation tends to reduce 
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the positive skewness commonly found in hydrologic data Some 
limitations nro that it has rmLv two parameters and i I requires 
the logarithms of the data to bo symmetric about their mean 

bet fj and c* be the mean and standard deviation of the 
lofl normally distributed variable The mean ft { n ) mid the standard 
deviation ^(n) of the normalised variable are related to // and t> 
by 

M(n) = t/2 in {/A <// ft- 7 ) j and (2 16) 


o< n) = fin 


(2 17) 


2.4.2 rearson and Log Pearson distributions 

Pearson Type 3 distribution, al *~o nailed the TVireo 
parameter Gamma distribution, introduces a third parameter, the 
lower bound £, so that By the method of moment's, the Hirer* sample 
moments (the mean, the standard deviation and the coefficient of 
skewness) can be transformed into the three parameters \ , /? and s 
of the probability distribution 

The Pearson system of distribution Include seven types, 
they aro all solutions for f(x) in an equation of the form 

d [f(x)l F( x ) ( x -do ) 


dx <V> + Qx + C ?\ 

where do is the mode of the distribution ( the value of 
which f (x) Is a maximum) and Co, O and O are coefficients 
determi ned 


(2 18 ) 

x for 
to be 


Pearson Type 3 distribution, also called the Three 
Parameter Gamma distribution has three parameters When 0=0, 
the solution of Kq ? JB is a Poaraon Tvpo 1 distribution, which 
has a probability density function given by 


f (x) 


n 


( ) 


f3-i -A { x-^ ) 




Cs = coefficient of skewness 
i>< = standard deviation of x 
(3 -(?/<)-. )’ 

£ - X - Cty. V (3 , 

and X = / V 13 


X > £ 


(2. 19) 


wher a 



Tim third parameter is the lnwey bound f For (li r (’? 

0, thn Normal distribution i s t ho nolut Ion of Eq ? 10 Thu*? 

Normal distribution is a special case of the E’earsort Type 3 
distribution, describing a nonskewed variable The Pearson Type 3 
distribution was first applied in hydrology by Foster (1924) to 
describe the probability distribution of the annual maximum fldod 
peaks 

When tho data are very positively skewed, a log 
transformation is used to reduce the skewness If log X follows a 
Pearson Type 3 distribution, then X is said to follow a Lor 
P earson Type 3 distribution As a special case, when log X is 
symmetric about its mean, the Log Pearson Type 3 distribution 

reduces to the Log Normal distribution. 

The location of the bound in the hog Pearson Type 3 
distribution depends on the skewness of the data If thn data are 
positively skewed, then log X > r and r is a lower bound, while if 
the data are negatively skewed, log X i r, arid ^ is an upper bound. 
The log transformation reduces the skewness of tho transformed 

data and may produce transformed data which are negatively skewed 
from original data which are positively skewed In that case, the 
application of Log Pearson Type 3 distribution would impose an 
artificial upper bound on the data 

i 

The frequency factor for Pearson Typo 3 distribution 

depends on the return period T and the coefficient of skewness Cs. 
When Cs ~ 0, the frequency factor Is equal to the standard normal 
variate Z. When Cs p 4 0, Kt ig approx imaLed by Kite (197 7) as 
Kt = Z + (Z 2 -l)k ► 1/3(7, 3 -6Z)k 2 - ( Z 2 - 1 ) k 3 + Zk‘ + i/3k s (?..?&) 
where k = Cs/6 

2.4.3 Extreme value distributions 

Extreme values arn selected maximum or minimum values of 
■sets of data, for example, the annual maximum disohajge at a given 
location is thn largest t recorded discharge value during n year and 
the annual maximum discharge value for each year of thn historical 
record make up a set of extreme values that ran be analysed 
statistically When the number of select od extreme values in large 
distributions of the extreme values selected from sets of samples 


12 


of any probability distribution have been shown by Fisher and 
TrlppM, (1920) tn cmivotgn to mm nf throe f'nntn of Extreme value 
distributions, called Typos I, II and 111 respectively 

The three limiting forms were shown by Jenkinson (1955) 
to be special oases of a single distribution called (die General 
Extreme Value ( GEV ) distribution The probability distribution 
function for the GEV is 

F(x) = exp t - ( 1 - k k 1 (P.21) 

where, k f y , a are parameters to be determined 

For k - 0, the distribution is the Extreme value Type 1 
distribution whose probability distribution function is 

f(x) = — exp C- — — - - exp (- *~ u - ■)] (2 22) 

Ci d & 

-DC < X < OC , 

where n “ w Y fi / n and n = fi - 0 5f72 o 

Bor Extreme value 'IVpe 1 distribution, x is unbounded 
For k ^ 0, i,ho distribution is the Ext tonic value Type 3 
distribution for which the GEV equation applies for -r< < x (u + 
ct/k) Tn both the cases, a is assumed positive For Extreme value 
type 3 distribution, x is bounded from above ( bj u + n/k) 

2.5 Estimation of paranvjters 

The parameters may be estimated generally by one of the 
three following methods (Ghow, 1964 , Yevjevich, 1,972) 

2.5.1 Method of Moments C MMJ 

Method of Moments relates the sample values of the 
moments to the parameters of the distribution The rth sample 
moment about any arbitrary x(o) is given 

M 

nr =■ i/H E (x(i)-x(o) J r (2.23) 

i* i 

where N is the size of the sample , which is an approximation to 
the rth population moment, 

IX 

J (x -xn y f (x) dx (2 24) 

-M 

B'or any distribution, The iirst moment about the origin gives the 



sample moan and the second moment about the mean gives the sample 
variance , viz , 

* x = 1/N £ x( l ) , (2 25) 

2 

c' ~ sample vnr(x) and {? ?8) 

-> 2 - t / ( H-l ) £ fx(<. )-xl ? <2.27) 

l = i,N 

x and vnr(x) are the sample estimat.es of /j and <y feu the 
theoretical distribution 

Estimation by the method of moments is asymptot ical ly 
efficient and the efficiency is usually smaller than unity For 
small samples, the estimates are significantly affected by extreme 
or off control points that may be present in the sample 
2. S* 2 Method of Least Squares t Ml SI 

This method consists in the estimation of the parameters 
of the assumod distribution by minimising the snrn of squares of 
deviations of the observed points from the fitted function Chow 
(1964) suggests the following procedure 

If x and are the 3mnple mean and standard deviation, 
the frequency factor K is defined as 

k - (x-x)/sx For any given d 1 sti 1 bu tl on , K can , be 
related to the cumulative distribution F(x) of the distribution. 

For a given xi , F ( x*. j is estimated from plotting position 
formulae. For the assumed distribution, R» Is obtained from F ( xl ) , 

The regression line of x on K is determined bv MLS Tills gives 
estimates of sample mean x and standard deviation s* . Though this 
procedure is not theore tlcal ly exact, it generally gives a better 
overall fit than the MM and further, the estimate is not affected 
very much by extremely rare occur lenoa as in the case of MM, 

A plot, between the frequency factor and the vat table Is 
plotted with the variable x being ori the ordinate and the 
frequency factor K being on the abscissa By definition, tills will 
be a straight line. The frequency factors for different 
dlst.ribul.ions are calculated n.s follows 

a) Normal distribution • The f r equencj 7 factor can bo expressed as 

Kt ~ (xt - x)/ 0 .^ , where x and me the mean and 

standard deviation of the data ami Kt - 7> , which is known as the 
standard normal variate 


can be 
at t he 


For Lag Normal rl 1 *j tr 1 bu LI on , the same proriodum 
npp I 1 cd pxpf pi l hat it \ «=• nppl t od { n t he 1 ogn r i t hms 
var Inblrr, , and their moan and standard deviation are need 

b) Kxtreme value distribution For Extreme value type i 

distribution, Chow (1953) derived the express 1 on 

Kt = - Y~S/n [ 0.5772 + In (Jn (T/T-U11 (2 20) 

where T is the return period The value of T is calculated as T = 
1/P(x) where P(x) is tho probability of exceedence obtained from 
the plotting position formula 

c) Log Pearson Type 3 distribution For this distribution, the 

first step is to take the logarithms of the hydjolagic data, v = 
log x Usually, logarithms to the base 10 are used Tho mean y, 
standard deviation sy and the coefficient of skewne* » fb are 
calculated for the logarithms of the data The frequency depends 
on the return period T and the coefficient of skewness O* . When Ck 
- 0, ihe frequency factoi is equal to the standard normal variate 
Z When 0> 0, Kt is approximated bj' Kite ( 1 97 7 ) ?\a 

Kr = Z f { Z 2 - 1 ) K i 1/3 { Z 3 -6Z )K ? - (Z'-DK 1 ' 7,K* < 1/3K’’ 

(? 29) 

where K = G-./6. The value of Z is the standard normal variate for 
given T 

2.5.3 Method of Maximum Likelyhood CMLED 

Maximum likelyhood estimate is that estimate of the 
parameters of a distribution for which the probability of 
occurrence of the actual observations is a maximum 

Let x(l), x(2), x(N) be the H observations and 

f(x), the assumed probabilitj' density function iri terms of the 
parameters P( 1 ) , P(2), P(J) Ass\?ming independence of 

events, the probability of the outcome p } s given by 

P = E f [x(1 )] (2 30) 

L-i,w 

For this to be maximum, tfP/rt Pj = 0 for all 1=1,2,. J. The J 
equations in terms of J parameters are solved to give tho maximum 
likelyhood estimates. For the Normal distribution, there estimates 
are the same as the estimates by Die MM For the two parameter Log 
Normal distribution, The MLE of the mean and the standard 
deviation are given by the mean and standard deviation of the 


logarithm of the sample 


2.0 1 i1c*nt l i i r_a 1 1 on of distribution 

For preliminary selection of pdf’s, statistical 
paramo tors are used generally, arithmetic mean is believed to be 
the best contra 1 value parameter The coefficient of skewness 
often acts as regionalisation parameter arid becomes very important 
when moan and standard deviation have small variation Often a 
combination of skew and kurtosis determines the pdf type A guide 
to the Fielection of pdf in terms of kurtosis and skewness of the 
sample data is given in Fig 2 3 (Roudkivi, 1979) Some criteria 
for the initial selection of pdf based on estimated sample 
statistics are prorented in Table 2.1, 

The following notations are used . 

/j = Population mean 

l 

a - Population standard deviation 
G= = Coefficient of skewness of popui a U on 
CV = a / t-t - Coefficient of variation 
O = Coefficient of kurtosis 


Selection of pdf largely depends upon the tyro of 
hydrologic variables, viz; , flood, drought, rainfall, etc for low 
floods or dr oughts , Extreme vai ue typo 3 or Pearson type 3 
distributions (Kite, 1977) will be suitable For floods, Log 
Normal, Pearson type 3, Extreme value type 1 and Extreme value 
type 3 distributions are suitable For rainfall, Extreme vaLue 
type 1 distribution is preferable (Chow, 1961) Similarly far 
Excoedenco series, Negative Exponential distribution Is well 
suited 


2. 7 Out l l or s 

An outlier in n net of may hr* defined an nn observation 
(or a subset of observations) which appeals to he inconsistent 
with the remainder of t hat sot of data In ricking out an 
observation (or a set of observations) as outliers, the main 
probl cm Is in deciding whether or not some observations are 
genuine member s of the main population If they are no t , 1 heri t ha 

attempts made to draw inferences about that population may be 




Beti (II shaped) distribu- 
tion region 

Pfts jJ shaped) dlstrlbju- 

t ion region 

— Bet n distribution region 


Gamma distribution region 


Exponent 1 b l distribution 
1 or- nor mn 1 distribution 


Fig 


2 ^ ^ c pl° tns 1° {£- c -b»* rlnnc for various distributions, 
nfler E 5 Pearson Prom Halm and Shapiro (1967). 



Table ? 1 


I Hour l Hal and Heuristic Par aim* tors Ranges 
for Different r re q Henries Distributions 


case Ilieortically 

nos. 

Hour i stir ally 

Suggestion 

1* C « 0.0 

s 

-0.5 < C <0.5 
s 


CV < 0. 4 

same 


C. > 1.0 

k 

same 


and 

t 



i if C, < 2.0 

K 

if C < 2.H5 

Uniform distribution 

i i . i f C = 3 0 

If H . 0 < c; < -1.0 

Normal distribution 

Hi. if C j >3.0 

i f 'l 0 < (’ < 10.0 

t d 1 s t r i but i on 

H • C < 0.0 

5 

C s ^“0.5 

Pearson type III or 

l o quo r ina 1 type 111 

ran be fitte d. Prior- 
ity may be given to 

Pearson type III on 

the bn sis of parsi- 
mony of parameters. 

3 * C >0.0 

5 

( >05 

s 


i . C, < ( 1.23C M .0) 

K 5 

s arne 

No d i s t r i bu t i on 

nr ( j s 1.0 


(impossible region) 

i i . C, = ec 13.0 

k 5 

1.P5C U(C, < P . P.'M' i 
s 1, s 

1 o gnu r inn 1 1 1 

f 

d 1 s t r i hu t 1 tin 


c nnt d 


Extreme value type 


i i i . C = 1 . 14 
s 

and C k = 5 . 4 
tv. = 3 .jO * 6C V 

Where, CV = 1/y~N~ 

v. C g = 4.0 
c k = 9 -° 

vi. (1.0625C 11.75X 

S 

C, < ( 1 .B125C f 1 . 75 ) 

k s 

vi i . <C +1.0) < 

5 

C, < (1.06H5C -» 1 .7b) 

k s 

vi 1 i . < I.HbC 13.0) < 

5 

t, < ( 1 .8 1E5C H .73) 
, k s 

Nate s Alternatively 



I distribution 
Gamma distribution 

Exponential 
distribution 
Rota <j) 
distribution 
Beta ( u ) 
distribution 
Ret a or Gamma 
distribution 


1. Input data can be log transformed to a Y series and then 


transformed data can be tested for Normal ,Pearson type III 


or External type I d i s r i bu t i ons . 

) 


misleading Hence, it becomes necessary to know the basic 

distincti on between fyt i eme ofrser vat ton •s , Ou t i i ers and 

( rm t rim 1 n/r?i t <* 

2.7.1 Dllfeience between Extremes, Outliers and Contaminants 

Suppose there is a random univariate sample of size n , 

xi, x? xn from a distribution whose form is denoted by F, 

The nr flared sample (from sum Meat. lo Intgesi ) is x(i ), m {? ) , 

, , x (n ) The observation x(i ) and x(n) are the sample 

Extremes To declai e either of them as an chit L > er depends on how 
they appear in relation to the postulated model F In FIr 2 4(a), 
neither x(i) nor x(n) appears to be outlying. Iri FIs ? 4(b), x(r>) 

is an upper outlier and x{i) also gives some cause for concern, 
x(i ) may be dec 1 at ed as an lower outlier (so that x (i ) , x(n ) is 

I 

an outlier pair x (n-i ) , x{n ) may be declared as an upper outlier 
pair) Hence, extreme values may or may not be outliers Any 
outliers, however, are always extreme (or relatively extreme) 
values in the sample 

Suppose that not all the observations come from the 
distribution F, but one or two come from a distribution G which 
has slipped upward relative to F {i o it has a larger mean) The 
observations from G are termed Cont anu nant s Such contaminants may 
appear as extremes but need not do so Fig ? 4(c) shows two 
contaminants (indicated by ), one of which is upper extreme, the 
other is in the midst of the sample nut , x(n) whilst an extreme 
and a contaminant, is not an outlier In Fig 2 4(d), however, a 
non -extreme con taniinan t is seen whi eh is no tie ( he Jess ou t lying as 
one of ihe outlier pair x (n-i ) , x(« ) If the number of observations 
from distribution G is not negligible in terms of the total sample 
size n, it may be necessary to treat the dlstri button as a mixed 
distribution, say two distribution 

Here outliers may or may not, hr con \ am 1 nnn t s , arid 
contaminants may or may not be outliers There may not be any 
method to know whether or not any observation is a contaminant So 
attention is concen tra ted on ouMleis as tin* possible 
manifestation of contami nation 




2. 7. 2 Oiiqtn of Outlie! 

In taking observations, different source 0 ; of variability 
can be encoun te t ed Three of the no that can be distinguished aie 

a ) Inherent vari nbi li tv Thl s is the express I on of the way i n 
which observations vary over the population ; such variation is a 
natural feature of the population It is uncontrol lable and 
reflects the distributional properties of a correct basic model 
describing the generation of the data Thus, for example, 
measurements of heights of men will reflect the amount of 
variability indigenous to the popuJatlon ( and this may well be 
reasonably modelled by a Normal distribution) 

b) Measurement error Often, physical measui ements nin taken on 
members of a population under study Inadequaei es 1 n the measurl ng 
i nstrument superimpose n further degree of variabi] i ty on ( the 
inherent factor, The rounding off of obtained values, or mistakes 
in recording, compound the measurement error they are part of 
it Some control of this type of variabilitj' is possibLe, 

c) Execution error A further source of variability arises in 
the imperfect coll action of data A biased samp! e may be chosen or 
individuals included ar r* not truly rnpr os on \ a 1,1 vn of the 
population that is being aimed to sample Sensible precautions may 
reduce such variability but one may not be aware of the execution 
errors and sometimes it may he appropriate to change the baste 
population model to encompass the prospect of anomral ous sample 
members 

2.7*3 Jests for Outliers 

There ale a number of approaches to identify and tnnt 
outliers ( Barnett V and Lewis . T, 1903 ) Some s tmple approaches are 
given below 

1) Not mal and J.nq Normal d l s 1 * 1 but I mis The following equation 
can be used to detect hLgh outliers 

Yu = v * Kn rv (2.31) 
where Yn is the hi gh out 1 i er tin eshol d and Kr. Is in terms of 
sample ni za as given in Table ? 9 Km values in Tahlo ? 9 mo used 
in the one sided tests that delect outliers at the 10 % level of 
sigrii f i cance in normally distributed data If the values in the 



22 


Table 2 2 

Out! ier test Kn values 


Sample Sample Sample Sample 


size n 


Kn 

size n 


Kn 

size 

n 

Kn 

size n 


Kn 

10 

2 

.036 

24 

2 

467 

38 

2 

661 

60 

2 

837 

11 

2 

088 

25 

2 

486 

39 

2 

671 

65 

2. 

, 866 

12 

2 

134 

26 

2 

502 

40 

2 

68 2 

70 

2 

893 

13 

2 

175 

27 

2 

519 

41 

2 

692 

75 

2 

917 

14 

2 

213 

28 

2 

534 

42 

2 

700 

80 

2 

940 

15 

2 

247 

29 

? 

549 

43 

2 

710 

85 

2 

961 

16 

2 

. 279 

30 

2 

563 

44 

2 

719 

90 

2 

'9B1 

17 

2 

309 

31 

2 

577 

45 

? 

72 7 

95 

3 

000 

18 

2 

335 

32 

2 

591 

46 

2 

736 

100 

3 

017 

19 

2 

361 

33 

2 

604 

47 

2 

744 

1 10 

3 

049 

20 

2 

385 

34 

2 

616 

48 

2 

753 

120 

3 

.078 

21 

2 

408 

35 

2 

628 

49 

2 

760 

130 

3 

104 

22 

2 

429 

36 

2 

639 

50 

2 

768 

140 

3 

. 129 

23 

2. 

448 

37 

2. 

650 

55 

2 

804 





Source. U S. Water resources Council, 1981. This table contains 
one sided 10-percont significance level Kn values for the normal 
distribution 


sample aro than Yn in Eq 2 3 1 , then tlp*y nrc fforiiidprfid 

high out] ier? 

A aiml 1 nr Rqunbion can bo used to detect low outliers , 

Yl = y - Kn -v ( 2 32 ) 

whore Yi I s the 1 ow outlier threshold in 1 og units, y and sv are 

the mean and standard deviation of the data 

According to the Water Resources Council (19B1), if the 
station skew is greater than i 0 4 , tests for high outliers are 

considered first, if the station skew is less than - 0 4, tests 

for 1 ow outliers are considered first Where the station skew is 
between t 0 4 and “0 4 f tests for both high and low outliers 

should be applied before eliminating any outliers from the data 
set 

For data distributed log normally, the log 

transformation transforms the data to Normal, and the above 

procedure can then be applied to the logari thms 

ill Pearson and Log Pearson 1 ype 3 distributions If the 

original distribution is Pearson distribution, it can be 

transformed to a Normal distribution by the Hllferty - Wilson 

transformation and then tested for outliers in the Normal 
distribution The procedm e consists I ri first stands rdi s i ng the 
raw series so that the resulting series has nearly aero mean and 
unit standard deviation 

Let xv be the Pearson Type III standard deviate obtained 
f rom the standardised da La with skew nnpf f i r* 1 ent g Then , t^io 
corresponding nojmal(0,l) deviate b can be obtained by the 
transformation 

tl = (5 /r f (ff xi /? I l) 1 ' 3 - )) -i r/6 (P 13) 

In case the probability distribution is a log Pearson 
Type III distribution, then the log transformation is used 
followed by standardisation and then Nil forty - Wilson 
transformation which transforms the distribution to a nearly 
Normal dis t r 1 bu ti on 

1 i i ) Lxt i omo val mo Iypo 1 anti I ypo 3 disti I hut Ions Ext rente 

value Type T distribution is fir^t i r amf o» mod to I bn Expnnontinl 
distribution and then to a tod for outliers In the Exponential 
di str i buLJ on , 





Tf X hm a Gumbo] Rronbe^i /rIup rll s hr 1 bu t \ nn , the 
transformed 7 andom virinMo Y - exp (-x/b) has nn Exponential 
din tribu bi on with origin zero and scale pa ranie t m exp ( -a/b ) where 


b = 5 * V ~ 6 / n and (2, 34) 

a - x - 0 577? b (2 35) 

So, if the value of b is known, the di scordancy of an 
outlier or a set of outliers in a sample from the X-di s hr ibut ion 
can be tested by transforming each observed value \\ to yi = exp 
(-»./b) and using on the y ’ s a discordancy test for an 
exponential sample with origin sero 

In this transformation, an upper outlier x(n) in the 


X-sample converts to a lower outlier y(i) in the Y-sampLe and 

vice-versa 

To test for discordancy in the exponential sample, a 
recursive procedure i s adopted A test Is used to i deni. if y a 
single upper outlier or lower outlier and is used recursively 
until all the outliers have been iden tif ied 

For hi gh outiieis, the test statistic f xt, / £ ) is 

compared with the 5 % critical values given in Table 2 3 and for 

low outliers, the test statistic ( xi / £ xj ) is compared with the 
5 % critical values given in Tabl o 2 4, for a particular length 
of data n The observed value is declared discordant if the tost 
statistic is greater than the critical value in the case of high 
outliers and less than the critical value m the case of low 

oufcli ers 

The sample data are now re transf ormed to the Exponential 

i 

distribution using the transforma b 1 on 

\ = -b log sy where (2.36) 

b = sx V 6 / it (2,37) 

For the distribution of Extreme value Typo III the log 
transformation is used first, followed by the above procedure 
which transforms it to exponential d i s tr i bn 1 1 on and then tested 
for outliers 

3* Q lo'cl for qnorines's of fit. 

The theoretical di air ibut Ion that la fit to th<b given 
sample data can bo validated by the goodness of fit tests, These 
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tosh's indicate the fioodn^'? oT fit - for a rnrUin di s tribu Li on at a 
particulni ronHd^iK'o k vnl The l*>sf used mnnl frequently is the 
Chi Bqimi n tost ns des< ^ l bod bo low 

The sample space is dividod Into 1 muinally exclusive 
classes with a class frequency of 5 or more Let p(i ) he the 
probability that the variable belongs to (he i th class for the 
assumed distribution If x (» ) and x(t + i) are tlie limits of the 
i-th class Interval, then , 

P(0 - F Lx(ui)] - F Lx(01 (2 38) 

Let f (t ) be the observed frequency of the sample from the i-th 
group If N is the total number of samples, the chi square 
statistic is given by 

, [f(i) - N<p(t))f 

X - T. (2.39) 

N p(i ) 

2 

If J is the number of parameters estimated, then theoretically x 

2 

has a distribution with (I-J-l) degrees of freedom Let X ( a ) 
denote the value of x at & % confidence level for the above 

degree of freedom as obtained from the tables If the calculated 
X is greater than the theoretical value, then the "ample deviates 
significantly from the assumed distribution at the given 
confidence level and the fit is rejected If it is less, then the 
fit is accepted, 

S. 9 Confidence Bands 

Statistical estimates are often presented with a range 
or confidence Interval, within which tho true value can reasonably 
be oxpocted to lie The size of the conf idonro Interval depends on 
the Co)\j td&nce levsl 0 Tiro upper and lower values of the 

I 

confidence interval are celled Con / limits 

Corresponding to the confidence level 0 is a 
significance level a, given by a - (l-/3)/2. For estimating the 

event magnitude for the return period T, the upper limit Ur,r< and 

lower limit Ijt a for a Normal distribution are given by 


Ur n = y t- sy K U t,c< and 

in a - y r sy l^T.n 


(2,40) 


where 


(2,41) 


K^t.cs and K t a ore the upp^r and lower ronf i denoc limit factors, 
whi ch ran hr used for the Normal d i s tr i bu f 1 on and Pr arson Ty pe 3 
distribution Approximate values for these factors are given by 


K* T T.a = (Kr + V Kt 2 - ab)/a and (2 42) 

= (Kt - Y Kt 2 - ab)/a in which (2.43) 


a = 1 - ZaV2<n-l) 


and 


b = Kt 7 - Zc* ? /n 


(2.44) 
(2.45) 

The quantity Za is the standard normal variable with exceedence 
probability a The same factors are used to construct approximate 
confidence limits for Extreme value Type 1 dsitribution 


2.10 Expel t Systems in Frequency Analysis 

There is no general agreement among hydrologists on the 
specific choice of any particular theoretical distribution for 
frequency analysis of a given hydrologic variable at a given site. 
Frequency analysis, being a data dependent technique is subject to 
many constraints, limitations and assumptions 

The data are assumed to be consistent, homogeneous and 
independent which in actual prac bice need no b be true The maximum 
floods are seldom, if ever, measured correctly due to uncertainity 
in occurrence time and lack of proper gauging facilities as well 
&3 rating curves at the proper time and place. As a result they 
are estimated from the extrapolation of rating curves and other 
techniques . These estimates cause inconsistency i n da ta with 
possible large errors There are many more constraints like tho 
variable type, sample si^e, outliers In data, none ta tl onar Ity with 
respect to the process involved etc which have to be taken care 
of. 

These problems, if not dealt with proper attention afid 

F 

with an intelligent knowledge rich heuristic approach, wi J 1 often 
lead to misleading results Hence, domain specific human expertise 
can be effectively used to tackle these problems. The ES may take 
oare of those probl oms to a lai go ox ten b and f I i a suitable 
theoretical probabil i ty distribution f uric bion to observed data 
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3.1 Inti mlurt i on 

Human experts Jn any field nt o frequontLy In ffreat 
demand and are also generally in short supply AI presents a 

i 

solution to such a problem through an expert system ( ES ) wh lch 1 s 
a computing system capable of representing arid reasoning about 
some knowledge rich domain with a view to solving problems and 
giving advice (Jackson, 1986) It is also known as Knowledge basod 
expert system ( KBES ) 

GaSChing (1901) defines KBEfl as ’ interactive computer 


program incorporating judgement, experience, ruloe of thumb, 

Intuition and other expertise to pr ovide Vnowl odgeobl «? advice 

about a variety of lawke ' So , ES J S act SU intelligent 

assistant to human a expert and also provide assistance to people 
who otherwise might not have access to expert advice 


3.2 Building Expert System 
3.2.1 Ai chi lecture of KBES 

The KBES generally has four principal components (Fig 
3 1 ) They are a knowledge base , workl rig memory , inf eronce engine 
and a user interface As KBES vary in design, they may have other 
components also for eg , graphics , sj- stem ana Lysis and other 
software 

a) Knowledge base A Knowledge base contains both declarative 

knowledge (facts about objects, events and situations) and 
procedural know! edge ( inf orma tion about courses of nc li on ) whl ch 
may be scientific, analytic or heuristic rules (Fig 0 ?) Although 
many knowledge represents! 1 on teohni ques have been used 1 ri ES , the 
most prevalent form of knowledge representation omrontlv used in 
JES is the ’ mio based produriion Rysiom approach The rules 

have generally two parts, conditions and actions The rules arb 
fired when the condl fcinn s are nia tolled w I th the facts. The actions 
can be for processing instructions or to control 1 ns true t i ons , The 
rul as may inclnrle meatrul cis which are in 1 on abou t ru les 
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Fig 3 1 ARCHITECTURE OF A TYPICAL ES 
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Fig 3 2 : 


THE COMPONENTS OF THE KNOWLEDGE BASE OF AN 
ES (Sohank and Childers, 1984) 
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b) Working memory ft in ( ho nurr^nt- no Ms o nn ( of t^ho 

knowledge lmn« and may Include n knowledge management modulo 

c) inference englnn It is the most crucial component of ES 

since it matches and manipulates the database for problem solvLng. 
It is the inference mechanism which also provides jus t if ica ti on 
for tho FS Three formal approaches impel In 1 it i s case are 
production rules, structured objects and predicate lc>gie, 
Production rules consist of a rule not, a rule interpreter which 
specifies when and how to apply the rules and working memory that 
holds data, goals or intermediate results Structured objects use 
vector represen tat Ion of essential and accidental properties 
Predicate logic uses prepositional and predicate calculus 

d) User interface It is the oomnumica tl on module which 

provides bidirectional exchange of information botweon the user 
and system 

3.2,2 PS technique* 

The order of execution of (fie rules and/or procedures in 
an ES is governed by the Inference engine in beimr, of the problem 
solving strategy used Maher (1906) considers two approaches 

a) The derivation approach It involves deriving a solution 
that Is most appropriate to the problem from a li«t of predefined 
solutions stored in the knowledge base of the F9 It includes 
forhnrd chaining ( or goal driven control strategy) , bachuxird 
chaining ( or data dtiven strategy) and a hybrid strategy 
combining both these strategies Forward chaining works from an 
initial state of known i acts To the goal state and backward 
chaining works from a hypothetical goal state to the farts perhaps 
in terms of subgoals The subgoals are preconditions for the goal 
stated If tho hyp>o thesis Is not supported by facts it, hosts for 
another goal state and no on in a predefined order of goals 

b) The formation approach It Involves forming a solution from 
eligible solution components stored in the knowledge base It 
includes problem reduction (into subprograms), plan-genornbe- test 
(which generates all possible solutions, prunes i nconsis tent 
solutions and tests tho remaining solutions), and agenda control 
In agenda control, a priority rating to each task Jn the agenda is 


according 


to the assigned 


assigned and the tasks are pei formed 
priority 

These techniques may bp* combined with other techniques 
for liter archical planning, least commitment backtracking and 
constraint handling (Maher, 1906) Boms other techniques available 
include inductive inference, metareasoning, ill structured problem 
and data handling etc 
3*2.3 Developing an ES 

Developing as ES ia a time consuming team work 
Particularly, for developing a sophisticated ES , an intensive and 
coherent effort is required Knowledge engineers and domain 
experts work together to design an ES The knowledge engineer 
develops the expert system and the domain f*xpf» 1 provides the 
information for the knowledge base Hayes-Robh et al (1983) have 
identified five sequential stages in the development of ES, shown 
in Pig 3.3 Each stage is iterative in natuie Some of the stages 
are shown Eig 3 4 They are self explanatory 
3.2.4 ES lools 

A wide variety of development tools and environments are 
available for ES These tools can be one of the general purpose 
programming languages or an ES shell An ES shell is a set of ES 
development programs containing no knowledge about a problem, but 
can be taught in^ particular field or other They contain all the 
modules required for ES Filling of their hollow knowledge bases 
makes them knowledge based expert system Broadly, the ES tools 
are categorised As 

1 Programming languages like PASCAL, C , 

2. At based extrapola tory programing languages like LISP and 
PROLOG , 

3, ES shells like VIDH! which may be based on item 2 , 

4 High level ES programming environments J ike 0PS5, ART, KES, 
NEXPERT, PC PLUS, RULEMABTER, OLTPS etc , and 

5 Mixed programming environments which allow the programmer to 
mix programming languages as in item 1 and item 2 wi tli high 
level ES programming environment as in item 4 
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3.3 Cl ITS as an ES Shell 
3. 3. 1 I lit i eduction 

CLIPS ( ’C’ L rinRuafle Integrated Production System) Is a 

rule based forward chaining mixed programming expert system shell 
developed by National Aeronautics and Space Administration (NASA), 
Houston, Texas The source program of CLIPS (version 43) is 
written in C language THe primary representation methodology is a 
forward chaining rule language based on Rete algorithm (Forge, 
19G2) for multiple pattern matching and conflict resolution CLIPS 
is a versatile software in that it can be incorporated with any 
other language like FORTRAN and C CLIPS can be used for 

developing a KBES either as an Interactive executable element or 
as an Embedded executable element 
3. P, Knnwl edjjo i opt eie»d at I on 

a) Facts ' A Fact represent e a piece of 1 n f o Mtia 1. i ori and is 

placed in a current list of facts, the inr t It acts as a 

pattern for matching the conditions of a rule in order to fire 
that particular rule. Facts can be added (asserted) in two ways, 
viz; , , prior to execution and bv the action of a rule firing A 
fact can be a number, a word, or a string Facts are defined using 
def facts construct. 

b) Rule The primary method of representing knowledge in CLIPS 

is a Puff? A rule is a collection of actions to be taken if the 
conditions are met The conditions are patterns which act as 
constraints and also provide a waj' to describe how to .solve a 
problem. Rules are defined using the defrule construct 

A rule is made up of a LHS part and RHS part The LHS of 
a rule is made up of a series of one or more patterns which 
represent the condition element of the rule Jn RUB, a list of the 
actions to be performed when LE1S of the rule is satisfied is 
given Tho arrow ( =-> ) separates RHP from Til IB Two important 
features of the R!U> actions are the use of 1 if bheri-olce ' 
structure and 1 while 1 structure 

Further detail s on facts and till os available in CLIPS 
are given in CL 1 PS mnriua 1 ( tlu 1 her t , l 9 H 7 ) 
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3.3.3 Influence Engine 

The inference ongine of CLIPS Is rule based and og 
indicated cat 1 inr (Sec 3.2 I ) works on a forward chaining 
inference mechanism It is baaed on Rete algorithm (Forge, 1984) 
for multiple pattern matching and conflict resolution 

3.3.4 Cycle of Execution 

In CLIPS, the starting and stopping points are not 
explicitly defined The inference engine applies the knowledge 
(rules) to the data (fact). The cycle of execution is as follows : 

1 The knowledge base is examined to see if the conditions of any 
rule has been mot 

2. All rules whose conditions are currently met are activated and 
placed on the agenda ( a stack) Rules having higher priority are 
kept on the top of the stack and are activated before a n^w rule 
Rules having lower or equal priority remain below the new rule 

3. The top rule on the agenda is selected and Its RMS actions 

executed, < 

As a result of RHS actions new rules can be activated 
and/or deactivated 

This cycle is repeated until all rules that can fire 
have done so, or until a so called rule limit is reached, 

3.4 Embedded application of CLIPS 

CLIPS has an added advantage of being integrated with 
external functions and/or C, FORTRAN or Ada language programs. 
This capability makes its application very flexible and more 
suited to engineering applications where numerical computations 
are of great importance, and where computer programs in any of the 
languages arc-* al ready available 
3.4.1 External function 

An external function defined by the user for hia 
specific use in problem solving can be eiiher in C or in the 
language within which CLIPS Ls being embedded An external 
fund ion can bo used in both l ho LUG and RHG of tho rules Data 
can he passed to and fiom thorn 


The other method of passing data, asserting a new fact 
directly into the CLIPS fact list, is done bj' calling the C 
function assert 

CLIPS provides some more advanced interface functions in 
which passing known variable types, accessing multifield 
variables, and building facts by scratch (to assert a large number 
of facts) are possible (Culbert, 1987) 

3. 4. 2 Embedded application 

a ) Genera 1 OLTPS is designed to be embedded wi th 1 n o their 

programs The embedding program can be a C language program or 
FORTRAN program or a Ada program In each case a main program is 
provi ded by the user which calls CLIPS like any other subrou tine . 
The basic changes which are to made to access CLIPS from main 
program in a different language is available ( Culbert , 1 907 ; 

Mlshra, 1990) 

b) FORTRAN - CLIPS interaction 

For complete language mixing four basic capabilities are 

needed 

L) A program in another language may be used as the main program 

and CLIPS can be called as needed lor reasoning, 

2) Pacts can be* assorted into CLIPS from other languages, 

3) CLIPS may call other functions written in any language from 

th© RHS of a rule and may pass parameters to the function , and 

4) In languages which can provide a meaningful return value, 

external functions may be called from the LHS of a rule (i e , 
used as predicate function), 

The main program written in FORTRAN Initialises CLIPS, 
loads the rule files, resets the p roc ear, , assn r Is the facts and 
runs the program by calling specialised CLIPS functions The 
functions are , init._cl i ps , load_rules, reset_clips, assert and 
run__clips in a sequence In order to load a rule file or to assert 
a fact the FORTRAN-stri rigs are to be converted to C-strings by 
calling a function * to ret Similarly for reverse ac L i on , i e , to 
pass a parameter from CLIPS to FORTRAN, the C-string is to be 
converted into FORTRAN by calling a function latyrtc 

Facts can bo assorted to CLIPS from FOR'I RAN ©1 ther as 
constrni n ts or as vari ab J o s 
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These functions are defined under ’ usrfuncs’ either in 
file ’ Madn.c’ (in interactive mode) or aru^ other file (in embedded 
application) Within usrfuncs a call should be made to the 
define -function routine for every function about which the user 
wants CLIPS to know User defined functions are searched before 
system functions and if it matches with one of the defined 
functions already provided, the user function will ho executed in 
its place 

a) Passing variable from CLIPS to external function 

CLIPS actually calls the function without nnv arguments, 
though they are listed directly following a function name inside 
CLIPS rules Instead the parameters are stored internally by CLIPS 
arid can be accessed by calling the functions - 

int num_aig( ) , 
char rstting(arg) , 
float rfloat(arg) , 

Lnt rnriknown ( arg ) , 

i nt arg , 

A call to num,arg will re tut n an integer telling how 

many arguments the function was called with 

A call to TvStting returns a character pointer and rfloat 
returns a floating point number The parameters have to be 

requested one at a time by specifying the parameter position 
number as the argument to r string or rfloat If the type of the 
argument is unknown, runknowri can be called to determine the 

type . 

b) Passing data from external function to CLIPS 

An external function can pass data into CLIPS in two 

ways It can return a value or can assert a new fact directly into 

the CLIPS fact_Jict If the external function is to be used as 

predicate, it must return a floating point number otherwise it 

can be a character, integer, word or unknown Return values can be 
usr»d as predl calcs, bound to variables or cnptm cd via pal torn 
recognition The return values do not have to be captured, but 

must be defined in CLIPS, and all external functions must roturn a 
value 



3. 3 Structure pf FALHVES pr ogi am 
3.5.1 Introduction 

Frequency analysis of continuous hydrologic variables 
embedding expert system (FACHVES) is an interactive fortran 
program with an embedded expert system The program consists of a 
main program, subroutines, FS interface and a rule based knowledge 
base 


3. 5. 2 Environment 

FACHVES programs have been developed , implemented and 
tested in VAX-VMS environment ^ However it can be implemented on 
HP 9000-800 UNIX environment on IBM PC with slight modifications 

3. 5. 3 Main !’i otji am 

'Hie main program of FACHVES is n menu based program 
consisting of one master menu and eight submenus 

The master menu can call any of the eight submenus 
depending upon the option of the user The submenus nre basically 
for preliminary analysis, method of analysis, choice of 
distribution, transformations, goodness of fit lest, plotting of 
results and IS 5 interface Each submenu consists of several 

options and depending upon user's choice, the desired subroutine 
can be called However, the user is supposed to create an input 
file or give input data interactively before calling the master 
menu for the first time Options for seasonal data analysis is 
displayed In the mas Lor menu only when the data i r . seasonal " 

3. 5. 4 Subi out i nps 

In FACHVES general subroutines already available in 
FORTRAN for statistical analysis and some subroutines, eg , for 
two step power transformation have been 3 m’orpora ted They can be 
called either from main program or from other subroutines, as per 
requirement The subroutines are basical Jy for. 

a) fitting different distributions 

b) applying different goodness of fit tests 

c) transforming the data using different transformations 

d) using different methods of analysis 



0) link ] hr FORTRAN program lo CLIPS mul 

f) converting strings from FORTRAN "to C and vice veisa 

The subroutines are programmed in such a war that they 
perform one or a combination of the above functions The 

limitations in using a subroutine, if any, are conveyed to the 
user interactively during execution 

One of the subroutines P'l'HELP acts as ES interface and 
it has been incorporated as an optional calling subroutine from 
FORTRAN main program The subroutine PTHELP links the main 

program with the CLIPS function calls and string conversion 

function STOREC The suggestions given by ES Is converted to 
FORTRAN strings by LOADC function and stored in a subroutine 

ADVICE which subsequently passes them to the main program 

3. 5. 5 Know! edge Rase 

The main function of the ES is to supplement the 
computat i ona l procedures specified I ri the main program with a 
number of advices for choosing the right alternatives for 

analysing the data Without the ES knowledge base (KB) similar 
task could have boon accomplished, however, only by going through 
an exhaustive process of all the available alternatives The ES 
knowledge base helps to eliminate some of the nonfeasible 
alternatives of the pdf fitting exercise It also incorporates 
subjective Judgements inbuilt in the KB, or provided by the use#, 
to choose the right path of evaluation, 

The ES is embedded in the main program and accessed by 
it during execution to accomplish the following tasks 

1) provide an interface for user supplied information in the 
decision making process, 

ii) transfer appropriate parameters to the FORTRAN main 
program, based on the advice of the ES, 

iii ) advice on the adequacy of the data for statistical 
ana l y si s , 

lv) ad v \ no or\ the Initial choice of d \ s 1 1 1 hu 1 1 nri functions 
based on estimated values of stall sticnl pa ranic l e rs , 

v) advice on the suitable chni rm of i rnrmformnl.iona based on 
the estimated values of statistical parameters, and 


M t- 


v L ) met hods of dealing with outliers in statistical data 

The rule based Eft ’ s knowledge base cons Isis of a sot of 
rules that reflects expert knowledge in this field It also 
provides a vehicle for incorporating subjective judgement in the 
selection of procedures for statistical analysis while guiding the 
user through various stages of decision making The flow chart 
showing the various components and functions of the ES knowledge 
base is shown in fig 35 

The knowledge base is accessed during the execution of 
the main program The type of inputs provided to the knowledge 
base directly from the main program includes the estimated values 
of the statistical parameters of the raw hydrologic da ta > and the 
length of the record Based on these inputs, the Eft guides the 
user through a series of steps, where existing patterns (rules) in 
the KB, are matched with the patterns constructed through user 
input and given input information Hatching or non-matching of 
these patterns lends to either a set of parameters nr control 
vaiiahles being passed on to the main program or the testing of a 
subsequent rule 

Once all the existing rules compatible wii h the facts 
asserted by the user or generated by the actions of rules have 
been tested, all the parameters and control variables as specified 
by the ES are transferred to the main program Also, a list of 
suggestions on the subsequent procedures of statistical analysis 
are displayed by the ES, to help the user Whenever the user needs 
some advice the ES knowledge base nan be accessed by the user from 
the main program by using either the master menu or the submenu 
preliminary anal ys Is 
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niAriER 4 


nrS(RIPHOM or rAMIVCS Vnr z.o 

4.1 Modi fit * { I on«5 to TA( livrs 

FACHVhb Indicated a general structure and included ootns 
programs for analysis A number of modifications and additions 
were made to the program arid they nro given below 

4.1.1 rinitlnq nl results 

a) Histogram for the data 

A Histogram is dravm for the given data wi bh the 
frequency on the ordinate and the class Interval on the abscissa 
The given data have been divided into eight class inter vala VAX 
in built subroutines like Draw line, Draw rectangle etc have been 
used bo dtaw Iho histogram The histogram comprises of a series of 
combinations of rectangles and straight lines For example, the 
rout i no 

RMHftbRAW hi MR ( DTDPhA f 1 , 1 fl , 7 , J n , f?0 ) 
draws a horizontal 1 ine on tho screen from 7 bh column to the 
fi0 bh column in tho 1 fl t h row 
Similarly, the t on tine 

HMGftPRAW REOTANGhK ( D ISFbAYl , R1 , CM , R2 , 02 ) 
where Ri , 01, correspond to the co ordinates of the top left hand 
corner of the rectangle and R2 , G2 ho the bottom right corner of 
the rectangle, draws a rectangle on the screen 

b) Cumulative distribution function 

The Cumulative distribution function for different 
distributions nrn di awn on tho screen using a plotting subroutine 
The data points are plotted row wise on the screen 

The* cumulative distribution function is calculated from 
the plotting positions The plotting positions are first 
calculated for different distributions Host plotting position 
formulae arc lepreaenbed by the f ol 1 owi ng form 

P(X > Xm) - (nrb) / (nH-2b) (4 1) 

where b In a parameter, n is the total number of data values and m 
is the rank of an observal ion when the data are arranged in the 
descend! ng order The value of b varies for different 
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cH nlrl htiM nri*5 For example, for normal 1 y di a hi \ hn Loci data, the 
HI om ( 1 0 f j 0 ) plotting position Is used with h - 3/0 , while for 
data cl 1 s l r i bu t od nocord In f* t c> TCxMenie valur* |-ypn t cl 1 s 1 ri bu fcion , 
Idle Qringortrn (1903) for mu In ( V> ~ 0 44) is i ho host For tho log 
Peer eon typo 3 di stri hub 3 on , the optimal value for b depends on 
the value of the coefficient of skewness, being larger than 3/8 
when the data are positively skewed and smaller than 3/8 when the 
data are negatively skewed However, a value of b = 3/8 has been 
used foi hog Pearson iype 3 distribution The same plotting 
positions can be applied to the logarithms of the data, when using 
the log normal distribution. 

The cumulative distribution function is now calculated 


as 

F( x ) = P(X £ x) 

= 1 - P(X > x) (42) 

The curve for the function is an S-curve which varies from 0 and 
1 In the graph F(x) is plotted on the ordinate arid the variable X 
on the abscissa respectively 

c) K~X relationship 

A plot between the frequency factor (K), ns the abscissa 
and the variable (X), as the ordinate is also plotted. By 
definition, this will be a straight line 'Die frequency factors 
are calculated for different distributions (Section 2 5) 

d) Confidence bands 

Confidence bands for 95 % confidence level are drawn to 
the fitted distribution, The upper and the lower confidence limits 
for the fitted distribution is calculated (Section 2 9) The 
confidence limits are represented by ’/* in the plot. 

e) Fitted distribution 

The fitted distribution is also drawn in the plot which 
is represented by ’ & ' 1 ri the plot 

4. 1 • 2 Tests for- outliers 

The tents for outliers in the given data are conducted 
as explained in Section ? 7 When an outlier is detected, it is 
considered to be a rare event that has occurred as historical 
data This means that it is not discarded but that its plotting 
position computed in the normal way is considered to be 
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apprnpi in Lie , while the value la not Henre, these rare events 
should hr- com! dr* red with onto when visually evaluating the 
goodnms of fit for 1-lie probability plots 

4. 2 SI i ucl ui e 

The modified version of FACHVES (Frequency Analysis of 
Continuous Hydrologic Variables), FACHVES Ver 20 Is an 
interactive menu based program with an embedded expert system to 
provide denial on making support for frequency analysis of 
hydrologic data The program essentially consists of four parts, a 
main program, subroutines, ES interface and a rule based knowledge 
base In addition to the FACHVES program, the subroutines OUTLIER, 
MLS , CONFBND, FREQPL , PRPLOT , TPLOT and GOFITST have been 
introduced. In the main program, ari additional menu OUTLIER TESTS 
has been added to the menu PRELIMINARY ANALYSIS 

Subroutine OUTLIER testa for the presence of outlier if 
any and identifies the number and values of outliers present in 
the data for any specific distribution In case no outlier is 
present, it sets the number of high and .low outliers to aero and 
the 3 e ore passed on to the main program 

Subroutine MLS calculates the method of least squares 
parameter estimates for n given distribution Hi yen a specific 
distribution , it estimates the plotting position by using a 
plotting position formula, determines the frequency factor and 
fits a lineal" regression equation between the variable and the 
frequency factor The Subroutine FREQPL. plots the histogram for 
the given data, the subroutine TPLOT plots the cumulative 
distribution function and the Subroutine CON F HMD uses a Subroutine 
PRPLOT to plot the fitted distribution and confidence bands in 
terms of the variable and the frequency factor A subroutine 
GOFITST compares observed and theoretical Chi square values to 
determine the goodness of fit of the frequency distribution 

Except for the above modifications, which define Ver 
2 0, the structure of the program is the same as given by Mishra 
(1990). Thn flow chart of the execution of the program, flow chart 
of the main program, details of the submenus arid the list of 
subroutines are given in Figs 4 1 to 4,3 and Table 4.1 
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MASTER MENU 
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Table 4 1 

List of Main Subroutines 


Sub rout i ne 


Funo t ion 


ADVICE 
CHI SORT 
CONFBND 


FAP 


FREQ PL 
G OF IT ST 


IPT 

LN2 


LN3 


LOADS 
LOG THAN 
LP3 


MLS 


ND 

OUTLIER 

PARAM 
PH PLOT 
PTHELP 


Passes ES advice to main program 
Chi -square test 

Plots the confidence bands, the fitted 

distribution In terms od the variable and 
frequency factor using the FRPLOT subroutine 
Fits different distributions for seasonal 

data, calculates seasonal variations and tests 
the goodness of fit (Chi-square test) 

Plots the histogram for the given data 
Compares the observed and theoretical 

Chi -square va lues 

Transforms the data by Inverse Pearson 

ti an s forma tton 

Fits a 2 parameter Log Normal distribution and 
estimates T >ear events 

Fits a 3 parameter Log Normal distribution and 

estimates T year events 

Converts a C string to FORTRAN string 

Transforms the data logarithmically 

Fils a Log Pearson Type 3 distribution and 

estimates T year events 

Calculates the method of least squares 
parameter estimates for a given distribution 
Fits a Normal distribution 

Identifies the number and the values of the 
outliers present in a given distribution 
fl La l 1 a t l on 1 pn mine tot on 1 on ! a t i rn» 

Plotting subroutine 
EG interface, calls FS program 
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rn 


SER 


SQTRAN 

flTORKC 

TIE 


T3E 


TP LOT 
TSPT 


!' H s a f rmi^.on Type 3 d I s 1 1 I bu Mori and 

i 

r s M mn hos 'I' >onr ( f, vrnl. r i 

Computation of the standard error? of events 
computed from various probability distributions 
compared to the observed event magnitude 
Transforms the data by Square root 
1 r a ns forma t ion 

Converts a FORTRAN siring to C string 

Fits a Type 1 Extremal distribution t and 

estimates T year events 

Fits a Type 3 Extremal distribution and 

estimates T year events 

Plots the cumulative distribution function 
Transforms the data by Two step power 
t r ansf orma t Lon 


cfni ; 

t 




*•*, at -*ft®** 
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respoc t i vo 1 y The Bubioullnes MLS, OlJTLTkbnnd FbEOFIi are given In 
Appendix A 

4.3 Inti ofluc i Ion to t hr> use of FACHVrS Vei ?. 0 
4. 3. 1 Fst i nint 1 on of Parameters 

Tiie parameters are estimated at, two stages during the 
execution of the expert system They are finl, esti mated during 
the preliminary analysis. At tills stage, parameters are estimated 
by the method of moments In the second stage of estimation, the 
choice is left to the user to choose his preference estimation of 
parameters This is done after a suitable distribution has been 
finalised after preliminary analysis and after obtaining 
preliminary advice from the expert system wherein all the possible 
distributions are indicated 

The parameters can be estimated by any one of the three 
methods vi ?; , the method of moments, method of least squares and 
method of maximum likely hood ( Section 2 5) But MLS is suggested 
in case outliers are detected. 

4.3.2 Fitting a Distribution 

After Outliers if presen I , have been Identified nnd the 
statistics calculated, preliminary advice is taken from the expert 
system wherein alJ the sui table distributions arc Indicated These 
distributions are now fitted to the data using different methods 
of parameter estimation Each distribution 13 tested for goodness 
of fit, vis , bv Chi square test oi Kolmogorov- Rmi rnov best (not 
Implemented) The distribution with the least Chi square statistic 
is general l v assumed bn fit the data best, Apart, from this, the 
goodness of fit Is also observed visually by plotting the results 
The given data and the fitted distribution along with the 
confidence bands are plotted and then finally the most suitable 
probability distribution i ha t fits thr data is decided upon 

Tiie use of FAUHVES Ver 2 0 for fitting probability 
distribution to data is demons b r a tod with a number of examples in 
Chapter 5 
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IRrulJFNCY ANALYSIS USING TACIfVCS Vei 2. O 

*3. 1 Da t a used 

The data used to teat the developed program consist of 
one set of' seasonal. data and two sets of norrseasonal data (Table 
5 1) 

A Non seasonal data 

i) Annual peak discharge fox- river Narmada (1951-1982) , 

ii) Monsoon flows for river Mahanadi at Hlrakud (1946-1982) 

B Seasonal data 

Streamflow data for Mahanadi Basin at Hlrakud dam site 
are used for season by season analysis The data are for the 
period 1931 to 1980 for 6 periods comprising of monthly data from 
June to October and Non monsoon data 

5. 2 Results and Discussions 

The capabilities of the program can be tooted for 

a) Analysis for nonsea sonal data and 

b) Season by season analysis (for seasonal data) 

S. 2. i , Nonseasonal dat a 

Analysis of two sets of data have been done and the 
interactive session with the program for one of them is reported 
completely in Table 5 2 

i) Set 1 The analysis is done for annual peak discharge in 

river Narmada The program is executed by typing the name of the 
executable file BAD DU followed by RETURN The program begins Its 
execution and the Input module comes across firs l It asks whether 
the Input will be given on the screen Since the input is given in 
an input file, 2 is entered The format requited for the input 

file and other details are displayed on the screen The grammer 
for the input file, given also in the screen is as follows the 

input file should be named as TEST INF with I he TITLE given in a 
80A1 format followed by the number of data values, number of 

seasons, number of points (NObABS \ 1, whore NOhASS is Win number 
of o Lasses info which <U\ ( a can be divided lor Chi-square tost) and 
then the observational data Since, such a file is aval lable, 1 is 

entered The Master menu is now displayed Initially the 
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16421 0 

9278.0 

2550 0 

1673 0 

1406 0 

856.0 

8934.0 

4953.0 

1029.0 

8319 0 

777.0 

1559.0 

9975 0 

2387.0 

IBM 0 

7474 0 

1493.0 

1757.0 

1700 0 

1709.0 

699 0 

19385 0 

2046 0 

2379 0 


Thble 


r > ? 


Internotlvo S e *a 3 i o n 1 

( Anmip l Peak Discharge in River Narmada ) 
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AAAAAAAAAAAAAAAAAAAAIAAA^AAAMAAAAAAAAAAAAAAAAAA 


A 

EXT FRT SYSTFM FOR FRH1UH1LY ANAIYSIS 

A A 

A 

OF 

HYDRO! OG IC PA1A 

AA 

A 


Id 11 F Ml "HR A 

A A 

A 

IMF MH'I 11 |iy | r, ( PAN 

AA 

A 

UNULR 

TUT IjIIIHANLC OF 

AA 

A 

Hr 

S Ramaseshan 

AA 

A 

at 

I I T Kanpur 

AA 

AAAAAAAAAAAAAAAAAAAAAAUiAAAAAAAAAAAAAAAAAAAAAAAA 

A 

WISH YOU 

HATTY LOMPUT IHfi 

AA 

A 

U 

I T H 

AA 

A 

FAC 

HOES Her 2 0 

A A 


AAAAAAAAAAAAAAAAAAAAAAAAA-MAAAAAAAAAAAAAAAAAAAAAA 

a a -- a FRioumr anaiysis with cxpfrt apvht a--a 

a - --- A 

AAAAAAAAAAAfAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
A INPUT MORN! F A 

A -- A 


Dn you want- In give input on Hid screen'' 

If yes, Type 1 ,If no, Type ? 

If you want io quit Type 3 

? 

-™Ihe input file should be named as TF5T.INP 
1 1 ntijs I have following 
A a T 1 T I r ui Hi format 00A1 
ANN, unfni m ted 
A NX--NumLer of data 

A MS -Humber of seasons 

(fm mien] aoi [rs MG 1) 

A NF'T -Number of points 

NPT=NLl A5S+ 1 where NLLASS is number of 
classes in which 'data can be divided 
for chi square test The class frequency 
shawl d be equal to 5 
( It can be 7/11/13) 

A Observational hydrologic data RX(I) 

(Not more than GOO; 

seasons ( in row)/yearsUn rolurafi) 


Have you got surh an input file 7 If yes, type 1 
H ynu want in quit ,lype ? 

If you want lo go to previous menu type 3 

1 

A --- — A 

A MrtSTFIMirmj A 

A - “ A 

A 1 PRELIMINARY ANALYSIS k 

k 2 CHUICF OF DISTRIBUTIONS k 

k 3 TTSTS PUR IjlKiriHrSS OF r IT A 

A -1 ri ATI I II it IN R I GUI TS A 

A 5 nun A 

A 6.IILLP I RUM FXI'FRT SYSTMI IMTERt ALE A 


Type the seiiil number of desired option 

1 

AAAAAAAAAAAAAAAHAAAA^AAUAAAAAAAAAAAAAAAA 


A %%\ FRFLIHINARY ANA! YSIS $M A 

A - - A 

A 1 STAY hi II A| FA RAMI T I PS A 

A 2 I PFHUrNt Y FI OT A 

A 3 c n e piur <rm* on pistributions) a 

A A CUNFIRFNIF PAMDS A 

A 5 , TRAN STOP II AT ID NS A 

A 6 1 1 r I F [RUN r v IFR1 SYSTm IHTIRFAfF A 

a 7 to mas r f r mini a 

A 8 , OUT I IF U TFSTS A 

A -- -- A 


Type the serial number of drslred option 
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i 

HErtH* 2538 7 ’’852 
var- o i Mfiirioa 

M I U 0 T.M'JOr i "0 
'll HI" 0 lO/'>MfO'i 

KlIRTdS IS- 0 40 Minot AVO l»FV IAT IOW“ 0 R?4IOF»04 

THIS AIIAIYSIG IS rnp LOO-TRANSFORHFD DATA 
H L A N = 10 0593 

VAR- 0 1 7 9 2 IF* 00 
SKEW" 0.%092F-02 
ST L'EV = 0 4 1 493F ( 00 

KURTU3 IS= 0 ?G' ) 4 0 F * 0 1 
XH IM= 0 93171EI01 XHAX = 0.10898F*02 

AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 


A m r R Ct IHIHAPY ANA l YS IS ttt A 

A - - -- A 

A 1 STATISTICAL PARAMETERS A 

A 2 FPEQUFNCr PLOT A 

A 3 C D F r I OT (FOR ALL DISTRIBUTIONS) A 

A 4 CONF IDFNTE BANDS A 

A 5 TRANSFORMATIONS A 

A G NFir FROM FXI TRT SYSTFH INTFRFACE A 

A 7 TO MASTER MENU A 

A 8 OUT! IEP TESTS A 

Typr the serial fi«i ml »n r of desired option 
6 

AAAAAAAAAAAAAAAAAAAAAMAAAAAAAAAAAAAAAAAAAAAA 
A U EXFTRT SYSTEM INTERFACF H AA 

A - - A 

A 1 FOR PRELIMINARY ADVIL! A 

A 2 FOR SFCONDARY ADVICE * A 

A 3 FOR EXIEPT ANALYSIS OF RESUITS A 
A 7 TO MASTFR HFIMJ A 

Which help do you want? 


Type the serial number of desired option 
1 

f-0 ( ini t i a 1 -f ac t ) 

f-1 (start) 

f"0 ( 1 IU t ! 3 1 ~f TP t ) 

f-1 (start) 

f-2 (mean 25382) 

f-3 (slew 0 95900002 ) 

f-4 (hirt 4 03439985) 

f-5 (sldev 1070G) 

f-6 (stdev of logtr an data 0.41497999) 

f-7 (length of data 20) 

U r I C 0 M F 

TO 

f a r ii v r s 

This expert system is designed for single 

peaked continuous hydroLogic variables only 

IF YOU DO NUT till PI RSTAMT ANY OUFSTION ; TYPF nk 
nk --- Not known 


Type of da fa type the first lettpr (o/a/s) 

Whether it is p-’ceedenre OR annual '’a 
Do you thirik that outliers are presents 
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nt 

In sbserirp of infer Mil ion outliers ire 

Initially assumed not to be present 

Use method of moments for parameter estimation 

Herr Is the list of '-I n p *» for analysis A A 

1 AAtm inr r r r i iminary anai ysig 

2 AATFST FOR OUT! IERS 

3 AAFIT A SUITAHF 0 1ST R I BUT I0H 

4 AATFST FDR G08PHESS OF FIT 

5 AAP10T IMF RESUITS 

You can fit 

1 a pearson type 3 distribution 

2 a lognormal 3 parameter distribution 

3 a loqpearson type 3 distribution 
If pearson type 3 is not suggested then 

Gamma (? parameters) may be fit 
1 You can fit pllher one OR all available distributions 

AA If more than one distribution satisfies the 

goodness of fit test (clu square test) then 
the best fit is the one with the least 
Chi square value 

PRDLFGS 13 DFlfh, Mill I Ml; 

RMji r im i Rfit tsg if ygu want to cqntiiiue 

WITH OTHFR STT OF DATA 
Do you want any help n no 

Suggestions passed should be taten in sequenrial order 

f 0 (liuti al-fact) 

f-1 (start) 

f -2 (mean 2530°) 

f-3 (shew 0 05030002 ) 

f-1 (hart 4 03120905) 

f-5 istdev 10706) 

f-6 (stdev of loglran data 0 11497999) 

f "7 (length of data 28) 

f~8 (type of data a) 

f-9 (process continue yes) 

f-10 (data is large enough) 

t - 1 1 (outliers are nt > 

f - 1 2 (outl lers rio ) 

f - 1 3 (positive stew) 

f-1 4 (logtrari val is 2 49346423) 

f — 1 5 (suggestion 15) 


f - 1 6 

( e m d 

F T 3, 1,P3, LN3) 

f-1 7 

(suggestion 10) 

f-10 

( es ad 

GD> 

f 19 

( s i ri p ) 


f 70 

(help 

no ) 

Excising 

rule 

r ' 1 1 r* 1 

E >ci b ing 

rule’ 

rule' 1 

E <r 1 s i ng 

rule 

r ul i*3 

Eve 1 s i ng 

r ul e 

r u 1 e 1 

E*'c i sing 

rule 

r u 1 e5 

F"c i s i ng 

rul c 

i'll eG 

E <c i s l ng 

rule 1 

rule7 

E 'C i 5 I rig 

rule 

ru 1 eD 

E’'c i s i ng 

rul r* 

i ul e9 

Excising 

rule’ 

ru I « 1 0 

Fvc 1 5 1 rig 

rule 

r u J r I 1 

E rising 

rule 

r u 1 n 1 *> 

F '( 1 s 1 ng 

ruin 

rule 1 1 

E 'c ini ng 

rule 

r u 1 r* 14 

Freising 

rule' 

rul e 1 5 

E*'c i sing 

rule* 

r ule 16 

F>'ci s i ng 

rule 

r u 1 r> 1 fl 

F i r i s i n g 

rule 

rul el 9 


E *ci 5 ing rule' r f j 5 n2\ 

F* cis i ng rule* rule 0 ’ 

E‘ c 1 s l rig rn I e * rul p°3 
F c j s i ng r ul ^ , r «il ^24 
Excising rule: rule25 
r w risjng ruin rulr'Ti 
How you ire nut of clips 
Uilue pissed = 10 00000 

Name passed is = 

PT3, LP3, l,N3 


A -- A 

A HASTF P-MFNU A 

A - A 

A 1 PR I L 1M INARY ANALYSIS A 

A 2 CIIOILF OF B ISTRIBUTI0HS A 

A 3 TFSTS FUR GOODNESS OF FIT A 

A 4 F f OTT ING OF RESULTS A 

A 5 nun A 

A 6 MFir FPUM FXr'FRT SYSTEM IHTFRFACE A 

A- A 

VAI UE PASSED IS 15 00000 
NAME PASSED IS 
PI 3,1 T'3,1 113 


Type the serial fiumhei of desired option 
1 

AAAAAAAAAAA^A UA A A AAA A A AAA AAA A AAAAA AAA A AAA 


A Mi PREI III INARY ANAI YSIS Mi A 

k A 

A 1.SIATISI1LA1 PARAMETERS A 

A 2.FRENUFNCY PLOT A 

A 3 C D.F PfOf (FOR ALL DISTRIBUTIONS) A 

A 4 CONFIDENCE BANDS A 

A 5 TRANSFORMATIONS A 

A 6, IIFLF FROM FXPERT SYSTEM INIERt ALE A 

A 7 TO MASTER MENU A 

A 0 OUTLIER TESTS A 

A A 


Type t h p semi number of desired option 
8 

HEAN= 253G^ 2852 

var= o inoiriog 

SKEW- 0 . S5S90E * 00 
BTDEV* 0 . 1 0706F » 05 

KURTOS IS= 0 40343E-* 0 1 AUG DEVIATION^ 0 82410E1-04 

THE DATA IS Or ING CNfrKrD FOR OUTLIFRS 

FUR WHICH BlSTPIBUniHI no YOU WANT IT CHFCKED I e 

WHETHER 1. NORMAL , n . LUG-N0RMrtL(1 PARAMETERS) 

3. LOG-NOPHALO I ARAMETLRS) , 4. EV-I 

5 E V - III, & PT- 111, 7 LP7-11I 


b 


6 HEAM* 253 Q ’ 1 205 2 
VAR = 0 1 MG 1 F *09 

skew- o.^s^oemiq 

STPF VI- 0 107061 <05 

KURTOSJS= 0 <1 0 3 1 3 E 1 0 1 AUG DEVIATION* 0 82410FMM 

THE PARAMETERS FDR THE ORIGINAL DAI A ARF 
MEAN- 7530' 1 ?G5? 

SITU- 0 , SS^GOF i 00 
STDFV* 0 . 1 0706 F t OS 

UTAH- 0 4441 

VAR = 0 31540n00 

SKEW - 0 1 DO?OL > 0 1 
SIDED- 0 56 1 GO E i 00 

KURTOS IS- 0 58336 E ^0 1 AUG DEVIATION* 0 43109E-»00 
AAA FOR OR IGIMAL DATA AAA 
CHECK FOR 0UTL1FR5 BY URC HFIHOD 


THE PARAMETERS FOR THE LOG-TRANSFORMED DATA ARE • 
HFAM= 0 0000 

SITU- 0 OOOOOMOO 
STDFV 3 0 OOOOOEfOO 


XI11H- 0 G*Mm 01 XHAX* 0 21437E+01 
FOR HORHA! DISTRIBUTION 

THE TIIRFSHO! D VAIIIF FOR HIGIIFR OUTLIERS : 1.867218 

THF THRFSHOI 0 VAMJI I OR I OUFR OUTLIERS I -0 8789062 
XH III- 0 G'MIGE-Ol XI1AX = 0 21432E+01 
NO Or HIGH OUTLIERS * 1 

NO OF I OW OUT! IERS= 0 
THF NFU TARAMrrrRS OPTAIHFP DIJF TO THE 
PRESHICr or OUTLtlPS ARE ' 

HFAN- 24320 0T1 
OAR- 0 86?J3F<0B 
SKFU- 0 G8045H00 
STnrv- o 'pn5ir<G4 

KURTOSIS- 0 34 < P?E«0I AUG DEVIATION 3 O.73410E+O4 
AAAAAAAA UAU UA UAAMUAAAAAAAAAAAAAAAAAA 


A m PREI IMUIARY AtlAl YSIS Ht A 

A - - - --A 

A I STATISTKAl F ARAMFTF RS A 

A ? FBI mn HI Y 11 t)T A 

A 3 C B.F.FLOI (FOR ALL DISTRIBUTIONS) A 

A 4. CONFIDENCE BANDS A 

A 5 TRANSFORMATIONS A 

A 6 lirip IPOH fxifrt system interface a 

A 7 TO MASTER MENU A 

A 8 OUTI IFR TESTS A 

A A 


Type the serjil numUer of desired option 
6 


aaaaaaaaaaaaaaaaaaaaaaaauaaaaaaaaaaaaaaaaaaa 


A \% FXITRT SYSTEM 1NTFRFACF %i AA 

A--- - - A 

A I FOR r R FI III JIIARY ADVICE A 

A 2 FUR SFLOHDARY AOVILE A 

A 3 FOR FXPFRT ANALYSIS OF RESULTS A 

A 7 TO MASTER HFNIJ A 

A - A 

1)1)1 rh 1ml p On yrm vnM 7 


Typn l An nNm(>f| nf <)eglro-I option 
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f -0 (initial-fact) 

f ' 1 (start) 

f 0 (iniMil f ic I ! 

f 1 (-.lirll 

f ? (win ' 1< i3R 1 ) 

f 3 (skew 0 33930002) 

f 4 (kur ( 4 0 M ’99U5) 

f G (stdev l<r, 0G» 

f-6 (stdev of logtran data 0 41497999) 

f-7 (length of data 2B> 

U E l C 0 H E 

70 

F A C H V E S 

This e pert system is designed for single 

peakei continuous hydrologic variables only 

IF YOU DO MOT UMDERSTAKI ANY QUESTION r TYPE nk 
nk Not known 

Type of data type the first letter (e/a/s) 

Whether it is eycecdence OR annual ?a 
Pa you lliirik that outliers are present? yes 
You should use mrtluid af least squares for 
jnr ameter estimation 

You ran fit 

l a pearson type 3 distribution 
1 a lognormal 3 parameter distribution 
3 a logpearsnn type 3 distribution 
If pearson type 1 is not r uggestej Lhpn 

G b mm a < ° parameters) oiay t<e fit 
1 You can fit either one OR all available distributions 

Ak If more than one distribution satisfies the 

goodness of fit Ins) (chi square test) then 
the test fit is the one with the least 
tin square value 

process is nriNh sntrrrn; 

Rf-SFT TIIF IROCFSS U YOU WANT 10 CONTINUE 
U ITU OTHFR EFT OF DATA 
Do you want any help " no 

Suggestions passed should be taken in sequencial order 


f-o 

( ini ti al-f act > 

f-1 

(star! ) 

f'2 

(moan 25382) 

1-3 

(skew 0 95990003) 

f-4 

(kurt A 0 )429905) 

f 5 

(stdev I070G) 

f-G 

(stdev of Jogtran data 0.41497999) 

f-7 

(length of data 2B) 

f-0 

( type of data a ) 

f-9 

(process continue yes) 

f 10 

(data is 1 ai gp enough) 

f 11 

(outliers are yes) 

f- 12 

(outliers 

f-1 3 

(outliers present) 

f- 14 

( pos i t i vp si cw > 

f-15 

(log! ran val is 1 4934M' , 3) 

f-16 

(suggestion 1G) 

f - 1 7 

(esad I T3 r l f ),H(3) 

f-10 

(suggestion 10) 

f 19 

<e-vl HP) 

f *0 

(Mop) 

f ?\ 

(help no) 

Excising 

rule r u 1 e I 

Fm i s i ng 

rule rule' 1 

E'-r i s t nq 

rule; rulel 


6? 


F 

rising 

ruin 

r i 

1 n 4 

F 

r i s i ng 

ruin 

ru 

t r 5 

F 

C 1 S 1 ng 

ruin 

n 

l nf> 

r 

r i s ltn 

rn \ n * 

r i 

ln7 

r 

r 1 1 l l, l 

r 1 ) 1 n 

i • 

i r n 

F 

1 i S i ng 

ruin 

1 1 

1 nO 

F 

rising 

ruin 

n 

1 n 1 0 

F 

( i 5 1 rn 

ruin 

r i 

Ini] 

1 

r i ' I ri 1 

ruin 

r > 

1 < 1 ' 

1 

1 1 1 i h'J 

ruin 

i ' 

In I) 

r 

clsing 

r u 1 n 

r i 

ini 4 

\ 

rising 

i ul r • 

r u 

lnlb 

F 

rising 

ruin 

r* 

I e 1 6 

r 

r 1 r t ng 

rule 

r i 

Ir in 

h 

i 1 s 1 1 n 

l u 1 n 

r i 

Ul 9 

C 

cl s ing 

rule* 

ru 

I p n 0 

F 

rising 

rul p 

r* 

lr21 

r 

r i s i tig 

r u 1 r • 

r u 

Ir'’? 

F' 

rising 

min* 

n 

in’ 1 1 

r 

c i s i rig 

rule 

r u 

ir'M 

i ' 

c i s i ng 

rule* 

1 1 

1 c25 

F 

rising 

rule 

ru 

1 e n 6 

How you 

ire o u 1 

of clips 

Ollue pi 

> r r d - 


15 00000 


N 3 m « pissed is = 
F T3 , 1 P3 r LH3 


A k 

k HAST FTC HFNU k 

k - - k 

k l IPFl IHIHAPY ANALYSIS k 

k 2 CIJOICF OF D1STR IPUT IONS k 

k 3 TFSTS FOR ISOUDNFSS OE S IT k 

k 4 HOTTING OF RFSUITS k 

k b nui r \ 

k 6 HE! F FROM EXf ERT SYSTEM I NT FRFACF k 

k k 

MAHJF f’ASSFP IS 15 OOOOO 
HAMF FASSFP IS 
F 13, LF 3, 1 M3 


Type the serial number of desired option 
7 

kkkkkkkkkkk kkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkk 


k 4$ METHOD OF AiiAl YSIS 4$ A 

V \ HF7HDD OF MOMENTS k 

k 2 HFTIIOD OF MAXIHUM LIKELYHOOD k 

k 3 .METHOD DF LEAST-SOUARES k 

k 4 TO MASTER MFMU k 

k k 

Which method do you liFe? 


Type the snriil number of desired option 

3 

It is only iviilible for Normal Distribution 

For other runs the >Joti Is transformed into normal variables and then MLS is u 
sed 

kkkkkhkk kkkkkk^ khkk Wkkkkhkkkkkkkhkkkkkkk 


kk ft fiioirF or n is tr i put ion ft kk 

k- - k 

k 1. NORMA! k 

A 2 f()G MU Rif A I ( n F ARAMFTFR) A 

A 3 I Oh MORMAI (3 F’ARAHFTFR ) A 

A 4 FTARSOM TYIT 1 A 

A 5 I 1)0 ! I ARM)!! TYI r 1 A 

A 6 l HRI Mr MAI III T Y F f- I A 

A 7 FXrRJMI MAIIJF T Y F IT-3 A 

A B FAH- PROGRAMS A 

A 9 rn !iAs n r nrim a 

A 10 TRAHhFORMATinMI A 

A n MrillOB OF ARAL IS IS A 

A— ---A 


If you want to lriii r tnrw 1 tie dill 1 ATYPF- 10 

If not , then uhirh d i ** * r i hut ion? type the serial number 


rrARSUM AND 1 Ofj r EARSON PISTR JDUTION 
AAAAAAAA AAAHAAAAAAAAAAAAAAAAAAAAAAAAA 


I 

xi n 

1 X< I) 

r< n 

KIP< I) 

KTl P( I> 

n 

47051 00 

4 60 

0 06 

1 76 

1 58 

1 

41691 00 

4 62 

0 09 

l 99 

l 97 

4 

1656 1 00 

4 56 

0 1 1 

t 1 J 

1 1 3 

5 

33915 00 

4 53 

0 16 

0 93 

0.90 

6 

33^70 OO 

4 52 

0 20 

0 76 

0 04 

7 

31601 00 

4 50 

0 23 

0 6? 

0 72 

0 

31601 00 

4 50 

0 27 

0 49 

0 61 

9 

29564 00 

4 47 

0 31 

0 37 

0 51 

10 

27935 00 

4 45 

0 34 

0 26 

0 41 

11 

2G232 00 

4 42 

0 3B 

0 16 

0 31 

1? 

25667 00 

4 41 

0 41 

0 06 

0 72 

13 

25023 00 

4 40 

0 45 

-0 03 

0 13 

M 

'’1154 00 

4 39 

0 4B 

-0 11 

0 04 

15 

23430 00 

4 37 

0 52 

-0 ’0 

-0 05 

16 

2*>75] 00 

4 36 

0 55 

-0 28 

-0 13 

17 

20746 00 

4 32 

0 59 

-0 36 

-0 22 

18 

19690 00 

4.29 

0 62 

-0 44 

-0 3 n 

19 

l^l 00 

4 29 

0 66 

-0 52 

-0 41 

70 

10391 00 

4 ’7 

0 G9 

-0 60 

-0 51 

71 

18101 00 

4.26 

O./J 

-0 68 

-0 61 

23 

17713 00 

4 ’5 

0 77 

-0 76 

-0 72 

23 

1660’ 00 

4 22 

0 00 

-0 85 

-0 05 

■>4 

16135 00 

4.71 

0 84 

-0 91 

-0 90 

25 

13631 00 

4 13 

0 87 

-\ 04 

-1 13 

26 

11982 00 

4 08 

0 91 

-1 16 

-1 32 

27 

11390 00 

4 05 

0 94 

-1 30 

-1 57 

'’O 

1IP7 00 

4 05 

0 98 

1 50 

-2 01 


THE CHI SQUARE STATISTIC OBTAINED HUE TO THE 
PRESENCE OF OUT I I HRS USING HLS IS * 

1 111 sauAPr s rATisTii 

AAAAAAAAAAAAA^AAAAAAA 
1 307353 

NO OF PEbRrLS OF FREEDOM 
AAAAAAAAAAAAAAAAAAAAAAAAAA 

3 

THE THEOR 11 IF AL CHI-SQUARE VALUE IS * 7 815000 

FOR 3 DEGREES OF FREFDOH 

Tlir FIT IS GOOD 

PFARSUN TVI'T 3 DISTRIBUTION 
AAAAAAAAAAAAAAAAAAAAAAAAAAA 

HP THOU OF I PAST SOUARFS 


AirilA - 1450 36/ 
PI TA - 4 311136 
GAHMA - 1374 070 


MEAN “ *M320 04 

ST[>FO - 9 ’05 079 

SKEW = 0 . 959B9S5 


M 

aaaaa*aaaa*aaauaaaiaaaaaaaaaaaaaauaaaaaaaaaaaa 


A ti T E'n T OF GUOliHrsS OH FIT M AA 

A A 

A 1 UM-SOUARr TFST A 

A '’.KOI HOIjUROV SMIRNOV TEST A 

A 3 TO M OTTIHfj PCSUI TS A 

A 4 COHT AR I s * IOM OF STANDARD ERROR A 

A 01 TGT T MAT F IOR D IF rFIlFHT A 

A IilSTRlIUn IONS A 

A 5 TO MASTER Ml MU A 

A 6 HCI P FROM FXI FRT-SYSTEH INTERFACE- A 

A- — A 


Type the sprial numLer of desired option 
1 

TIIF KTU I ARAHFU IIS UPTAINFP DUE TO THE 
PRLSENCF OF OUT ( IERS ARE 
HFAH- 243 n O 0371 
var- o e^nnoo 
SKEW- 0 A 0 O 45 E \ 00 
STPFV- 0 9 n 05 1 F i 0 4 

KURTOS IS- 0 3459 ^E + 01 AVG DEVIATION* 0 7 34 1 8 E « 04 

THE CIM SOUrtRF STATISTIC OBTAINED PUF TO 
THE F RFSF 11 LF OF DUTI IERS USING HM IS 

CHI SOUARF STATISTIC 
AAAAAAAAAAMAAAAAAAAA 

1 307353 

MO OF PEGPFFS OF FREFDOM 
AA AAA AA AAAAAAAAAAAAAAAAAAA 

3 

TIIF THFOR IT ICAI CHI-SGUrtRE VALUF IS • 7 815000 

FOR 3 PC UR FES OF FRF EDOM 

TIIF FIT IS GOOD 

Tlir CHI SDUARE S TO T J ST II (IDIAINM* WITHOUT 
TIIF PRFGFNI F OF OIJrilFRS USING HM IS 

CHI'-SnUftRE STATISTIC 
AAAAAAAAAAAAAAAAAAAAA 

2.109704 

NO OF DEGREES OF FREEDOM 
AAAAAAAAAAAAAAAAAAAAAAAAAA 

3 

THF THFOR I T ICAI CIII-SOUrtRF VAI UP IS 7 015000 

FOR 3 PFGRFES OF FREEDOM 

THE FIT IS GOOD 

AAAAAAAAAAAAAAAAAAAAAAAAAAAA 4 AAAAAAA 4 AAAAAAAAAAA 
AA M PI OTTING RFSUI IS A 

A 1 FRI m III Y PI UT A 

A ? I n F PLOT A 

A 3 CONT 1 DFHCF HANDS A 

A 4 TO HASTFR HI NIJ A 

A- A 

Type the sniil ndmhfM nf rtpsirnd option 


4 

A — - --- - A 

* HA5TFR HFNU A 

A - - * - - - A 

A 1 PRELIMINARY ANALYSIS A 

A 2 LHOICF OF D I S TR IBUT IONS A 

A 3 IF3TS FOP GOOPMFSS OF FIT A 

A 4 HOTTING OF RFSULTS A 

a 5 mm A 

A 6 HFI P FROM FXIERT SYSTEM INTERFACE A 

A A 

VAIUF rASSED IS 15 00000 
HftHr lAssro is 
r T3 r I f ?,LN3 


Type the serial number of desired option 
I 

AAAUUAAAAAAAAUAAAAAAAAAAAAAAAAAAAAAAAAA 


A m PRFLIHINftRY ANALYSIS m A 

A - - — — --A 

A 1 statistical iarahfiers a 
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areliminary analysis is done and hence 1 is entered. The details 
of the Submenu of preliminary analysis are displyed and 
Statistical Parameters option is now chosen The statistical 
parameters, viz , the mean, variance, skew, standard deviation, 
^urtosis and average deviation for the original data as well as 
log transformed data are calculated The statistical analysis show 
that the data are positively skewed (0 9599), coefficient of 
kurtosls Is 4 0343 and the coefficient of skew of the log 
transformed data Is nearly zero (0 0096), the mean and standard 
deviation of the original data and log transformed data are 
25382 2852, 0 10706E05 and 10 0593; 0 41498 respectively Mow, the 
preliminary advice is taken from the Expert system interface hy 
entering 6 and then 1 The expert system asks for the type of data 
and whether outliers are present Depending on whether the data 
are exceedence, annual or seasonal, the letter e, a or s is typed 
Here the data being annual, a is entered Initially, since it Is 
not known whether the outliers are present or not, nk is entered 
The expert system suggests that in the absence of information, 
outliers are initially assumed not to be present It suggests 
method of moments for parameter estimation and gives the list of 
steps for analysis, viz , preliminary analysis, test foi outliers, 
fit a suitable distribution, test for goodness of fit and pLot the 
results , 

The KBES suggests all the possible distributions that 
can be fitted to the data For the dta of Narmada river, the KBES 
suggests a distribution which can be a Pearson Type 3, 3 parameter 
Log Normal distribution or Log Pearson Type 3 distribution 

The analysis is done in the same sequence Initially the 
outlier tests are done for Pearson Type 3 distribution which 
indicates the presence of an upper outlier The Expert system 

interface is again approached and the secondary advice Ls taken 
Here, when the question * Do you think outliers are present ’ is 
asked, yes is entered. The expert system suggests method of least 
squares for parameter estimation Now, choice of distribution 
option is chosen by entering 2 and the method of least squares is 
chosen as the method of analysis by entering 3 The distribution 
fit is the Pearson Type 3 which is chosen by entering 4. In the 
method of least squares, the value of the oultier is neglected 


while the plotting position value is retained and then the 
regression is done between the variable and the frequency factor 
Here, the rank (I), variable X(I), logarithm of the variable 
LX(I), the plotting position P(I), frequency factor for the 
variable and logarithm of the variable KTP(I) and KTLP(I) 
respectively) are displayed on the screen The Chi square 

statistic obtained with the deletion of outlier values using MLS 
is 1 3074 as against the theoretical value of 7 315 for 3 degrees 
of freedom and the fit is good The parameters for the Pearson 

type 3 distribution are displayed on the screen as follows 
ALPHA = 4456 367 MEAN = 24320 04 

BETA = 4 3412 STDEV = 9285 079 

GAMMA = 4974 078 SKEW = 0 9599 

The coefficient of skew of the original data is used to calculate 

the parameters of the Pearson Type 3 distribution The optimal 
value of the coefficient of skew can be obtained by conducting a 
linear search by Golden Search method, get a value of skew, then 
calculate Kt followed by n regression between X and K? and then 
calaoulate the sum of the squared errors, which is repeated until 
the sum of the squared errors is a minimum This has not been 
implemented in the study 

Next the Chi square test is chosen This option 

calculates the Chi square value obtained with the deletion of 

outlier values from the distribution using method of moments 

parameter estimates, (1 3074) and the Chi square value obtained 
for the original data set using method of moments parameter 
estimates (2 1097) as against a theoretical value of 7 815 for 3 
degrees of freedom 

The above procedure Is repeated for 3 parameter Log 

Normal distribution The outlier tests indicate the absence of any 
outlier and even though MLS option is available, the method of 
moments is adopted as the method of analysis The Chi square 
statistic obtained is 5 '666 as against a theoretical value of 
7 815 for 3 degrees of freedom The fit is hence good at 95 % 

confidence limit 

The analysis for Log Pearson Type 3 indicates the 

presence of an upper outlier. So, MLS is used for the method of 
analysis The Chi square value obtained by MLS deleting the 


outlier value is 0 3843 as against a theoretical value of 7 815 
for 3 degrees of freedom The parameters obtained for Log Pearson 
Type 3 are displayed on the screen as follows 

ALPHA = 8 111E-04 MEAN = 4 3552 

BETA = 43229 07 STDEV = 0 1686 

GAMMA = -30 7081 SKEW = 9 6193E-03 

Similarly the Chi square values obtained using method of moments 
with the deletion of outlier value from the data set and for the 
original data set are 4 5381 and 4.7777 respectively as against 
the theoretical value of 7 815 for 3 degrees of freedom at 95% 
confidence limit Since the Chi square value obtained by MLS which 
takes the outliers into consideration is the least, these 
parameters are taken to draw the confidence bands How, on 
entering the number 3 for confidence bands, the fitted 
distribution indicated by , the sample data indicated by ’ , 

and the confidence limits indicated by ’/' are drawn (Fig 1 , 
Table 5 2) 

Since the Chi square value obtained by the deletion of 
the outlier value from the data set using the method of least 
squares for parameter estimation is the least for Log Pearson Type 
3 distribution , this distribution is taken as the best fit for the 
data of river Narmada among the distributions tested for the 
methods of parameter estimates used The Histogram for the data is 
given in Fig 5 1A and the cdf plot is given in Fig 5 IB The 
summary of the analysis is given in Table 5 3 

ii) Set 2 The analysis for the monsoon flows for Hirakud 

shows that the data are positively skewed (1 1925) and a 
coefficient of kurtosis of 6 8911. The KBES suggests a 
distribution which is either a Pearson Type 3 or Log Pearson Type 
3 or 3 parameter Log Normal distribution 

The Pearson type 3 distribution has an upper outlier and 
no lower outliers The chi square statistic obtained by taking the 
outlier into consideration and using method of least squares as 
the method of analysis is 6 8949 The Chi square values obtained 
by using the method of moments with the outlier and without 
outlier are 6 89 and 11 49 respectively as against a theoretical 
value of 7 815 for 3 degrees of freedom Except for the Chi square 
value obtained for the original data using MM which is high, the 
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FIG. 5.1A HISTOGRAM FOR FLOOD FEAICS IN RIVER NARMADA 


5 IB CDF Plot for Flood Peaxa in River Narmada 
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Table 5 3 


Analysis for river Narmada 

Theoretical Chi square value x ( 95^ , 3D0F) = 7 815 


Stat 

parameters 

Diet 

suggested 

Outliers 
if any 

Outliers 

considered 

Method 

of 

analys 

Dist 

fit 

— *"7 

X 1 
valu 

kurt 

skew 
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low 

High 

Low 
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0 959 
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MM 
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0 

1 
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MLS 
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MM 
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0 

MM 



H 


Distribution Fit 
Method of analysis 
Estimated Parameters 


Log Pearson Type 3 
Method of least squares 
ALPHA = 6 1 IE-04 
BETA = 43229 07 


GAMMA 


-30 7081 














other two Chi square values obtained using MM (deleting the 
outlier value) and MLS (deleting the outlier value) indicate the 
Pearson Type 3 distribution is a good fit at 95% confidence level 

The analysis for 3 parameter Log Normal distribution 
does not indicate any outliers The parameters are estimated using 
the equations given by Kite (1977) A subroutine to estimate the 
parameters is also available which has been incorporated in the 
study The Chi square value is 5 83, The analysis for Log Pearson 
Type 3 indicates the presence of an upper outlier The Chi square 
value obtained using the method of least squares taking the 
outliers into consideration is 8 945 The Chi square values 
obtained using the method of moments with the outliers taken into 
consideration and due to the absence of outliers are 6 89 and 
11 49 respectively 

The 3 parameter Log Normal distribution is the best fit 
with a Chi square value of 5 83 * z (95%, 3D0F) = 7 815 The 
parameters for the 3 parameter Log Normal distribution obtained is 
given in Table 5 4 The T year recurrence Interval. estimates are 
also given in the Fig 54 The histogram for the data and the cdf 
plot are given in Fig 5 2A and Fig 5 2B The results are shown in 
Fig 5 3, 

S. a. 2 Seasonal data - The analysis for all the seasons for the 
streamflow data for Mahanadi basin is done and the analysis for 
the 3rd season, i e , for the month of August is reported 
completely in Table 5 5 The histogram and cdf plot are shown in 
the Figs 5 4A and 5 4B The fitted distribution is shown in Table 
5 6 The T year recurrence events are also shown in the Table 5 6 

The analysis for the month of August (3rd season) shows 
that the data are negatively skewed (-0 0934) and the coefficient 
of kurtosis is 2 6669 The KBES suggests Normal distribution for 
the data Outlier analysis Indicates the absence of any outliers 
for the Normal distribution The KBES suggests the method of 
moments as the method of analysis Using the method of moments, 
the Normal distribution is fitted The observed Chi square 
statistic is 1 163, * 2 (95%, 3D0F) = 7 815. The confidence bands 
plot shows that the Normal distribution is the best fit 

The analysis for each season, i.e , the distribution 
suggested and the distribution fit are given in Table 5 7 


TABLE 5 4 


FITTED DISTRIBUTION FOR MONSOON FLOWS FOR RIUER MAHANABI AT HIRAKUD 


THREE PARAMETER LOGNORMAL DISTRIBUTION 
METHOD OF MOMENTS 


MEAN OF X 
VARIANCE OF X 
SKEW OF X 
A 


0 . 36383E+04 
0 . 2 1 487E+07 
Q . 1 1 4 3GE + 0 1 
-0 37778E+03 


T, YEARS 2 5 10 

X 0 . 3394 8E+04 0.47026E + 04 0.55580E+04 
T 

S 0. 27421 E+03 0.35519Eh03 0.47614E+03 


30 50 100 

0 &371BE+04 0 74219E+04 0.82113E+04 
0 70147E+03 0 . 1 1334E+04 0.15421E+04 
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Table 5 5 


o4 


Interactive Session II 
{ Seasonal Data for the Month of August for 
Mahanadi Basin ) 

A AAAAAAA A AAAAAAAAAAA 
A EXPERT SYSIEH FOR FREQUENCY ANALYSIS A A 


A GF HYDROLOGIC DATA AA 
A BY 0 P HISHRA AA 
A IMPROVED BY K.R C RAO AA 
A UNDER THE GUIDANCE OF AA 
A DrSRantaseshan AA 
A at 1. 1 T. Kan pur A A 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
A UISH YOU HAPPY COMPUTING AA 
A U I T H AA 
A FACHVESVer 2.0 AA 
AAAAAAAAAAAAAAAAAAAA A AAAAAAAAA AAAAA AAAAAAAAAAA AA A 
A A--A FREQUENCY-ANALYSIS WITH EXPERT ADVICE A — A 

A A 

AAAAAAAAAAAAAAAAAAA AAAAAAAAAAA AAAAAAAAAAAA 
A INPUT MODULE A 
A A 


Do you want to give input on the screen 7 
If yes, Type 1 , If no, Typp 2 
If you want to quit Type 3 

2 


— The input file should be named as TEST.INP 
It must hive following : 

A 3 TITLE with format 80A1 
AND, unf orfiiated 
A NX— Number of data 

A NS— Humber of seasons 

(for annual series NS =1) 

A N P T - - Number of points 

NPT=NCLAS3+1 where NCLASS is number of 
classes in which data can be divided 
for chisquare test The class frequency 
should he equal to 5 
(It can be 7/11/13) 

A Observational hydrologic data RX(I) 

(Not more than A0O; 

season s(in row)/years(in column) 

Have you got such an input file 7 If yes, type 1 
If you want to quit , by p 0 2 
If you want to go to previous menu type 3 

1 

For which season do you want to do the analysis 7 


Type the serial no of the season 
3 

THIS COMPUTATION IS FOR SEASON NO 3 

A * 

A MASTER-MENU A 

A A 

A 1 .PRELIMINARY ANALYSIS A 

A 2. CHOICE OF DISTRIBUTIONS A 

A 3 TESTS FOR GOODNESS OF E IT A 

A 4. PLOTTING OE RESULTS A 

A 5.C1UIT A 

A 6 HELP FROM EXPERT SYSTFM INTERFACE A 

A 7. ANALYSIS FOR OTHER SEASONS A 

A A 


Type the serial number of desired option 
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AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 


A $$$ PRELIMINARY ANALYSIS fit A 

A — A 

A L. STATISTICAL PARAMETERS A 

A 2 . FREQUENCY PLOT A 

A 3 C D F PLOT (FOR ALL DISTRIBUTIONS) A 

A 4 CONFIDENCE BANDS A 

A 5. TRANSFORMATIONS A 

A 6 HELP FROM EXPERT SYSTEH INTERFACE A 

A 7 TO MASTER MENU A 

A 8 OUTLIER TESTS A 

A A 


Type the serial number of desired option 
1 

MEAN* 15156.8398 
VAR = 0 1821OE+O0 

SKEW* -0 93395E-0 1 
STDEV= G.42674E+04 

KURTOS IS= 0, 26669E+OI AGO DEVIATION^ 0 34710E + 04 

THIS ANALYSIS IS FOR LOG-TRANSFORMED DATA 
MEAN= 9 5831 

VAR = 0 97444F-0 1 

SKEW- -0 78O38E+O0 
STDEV= 0.31216E+00 
KURTOS IS- 0 , 33245E + 0 1 

XM IN= 0 B7581E+01 XMAX = 0 10099E+02 

AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 


A $$$ PRELIMINARY ANALYSIS $$$ A 

A A 

A I. STATISTICAL PARAMETERS A 

A 3. FREQUENCY PLOT A 

A 3.C.D.F PLOT (FOR ALL DISTRIBUTIONS) A 

A 4 CONFIDENCE BANDS A 

A 5 . TRANSFORMATIONS A 

A 6 HELP FROH EXPERT SYSTEM INTERFACE A 

A 7 TO MASTER MENU A 

A 8. OUTLIER TESTS A 

A , 1 , 

Type the serial number of desired option 

r> 

AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
A <$ EXPERT SYSTEM INTERFACE $$ AA 

A A 

A 1 FOR PRELIMINARY ADVICE A 

A 2 FOR SECONDARY ADVICE A 

A 3 FOR EXPERT ANALYSIS OF RESULTS A 

A 7 TO MASTER MENU A 

A A 

Which help do you want 7 


Type the serial number of desired option 



1 

f-0 (initial-fact) 

f-1 (start) 

f-0 (initial-fact) 

f-1 (start) 

f-2 (mean 15157) 

f-3 ( sk eu -0.093395) 

f-4 (kurt 2 6668 q 992 ) 

f-5 (stdev 4267 39990234) 

f-6 (stdev of logtran data 0 31215999) 

f-7 (length of data 50) 

WELCOME 

TO 

E A C H V E S 

This expert system is designed for single 

peaked continuous hydrologic variables only 

IF fOU DO MOT UNDERSTANT AMY QUESTION ; TYPE nk 
nk — Not known 

Type of data type the first letter (e/3/s) 

Whether it is exceedence OR annual °a 
Do you think that outliers are present' 1 nk 
NORMAL DISTRIBUTION CAN BE FIT 
In absence of information outliers are 

initially assumed not to be present 

Use method of moments for parameter estimation 

Here is the list of steps for analysis AA 

1 AADO THE PRELIMINARY ANALYSIS 

2 AATEST FOR OUTLIERS 

3 AAFIT A SUITABLE DISTRIBUTION 
4.AATE5T FOR GOODNESS OF FIT 

5 AAPLOT THE RESULTS 

l.You can fit either one OR all available distributions 

AA If more than one distribution satisfies the 

goodness of fit test (chi square test) then 
(he best fit is the one with the least 
Chi square value 

PROCESS IS BEING STOPPED; 

RESET THE PROCESS IF YOU WANT TO CONTINUE 
WTTH OTHER SET OF DATA 
Do you want any help n 



o / 


no 

Suggestions pissed should be taken in sequencial order 

f-0 ( ini 1 1 al-f act ) 

f-1 (start) 

f-2 (mean 15157) 

f-3 (skew -0 093395 ) 

f-4 (kurt 2.66609992) 

f-5 (stdev 4267 39990234) 

f-6 (stdev of log tr an data 0.31215999 ) 

f-7 (length of data 50) 

f-8 (type of data a) 

f-9 (process continue yes) 

f - 1 0 (data is large enough) 

f-11 (outliers are nk ) 

f-12 (suggestion 6) 


f -13 

( es ad 

ND) 

f- 14 

(outliers no) 

f-15 

(stop) 


f-16 

(help 

no) 

E ' c l s l n g 

rule’ 

r u 1 e 1 

E cising 

rule* 

r u 1 e 2 

E; cising 

rule* 

rule3 

Excising 

rule* 

r u 1 e 4 

Excising 

rule* 

r u 1 e5 

Exr i s mg 

rule: 

r u 1 e 6 

Excising 

rule* 

rule7 

Excising 

rule 

r u 1 e 8 

E / c l s i n g 

rule 

r u 1 e 9 

Excising 

rule* 

rule 10 

Excising 

rule* 

r u 1 e 1 1 

Excising 

rule 

r u 1 e 1 2 

Excising 

rule: 

r u 1 e 1 3 

E> cising 

rule 

r u 1 e 1 4 

E cising 

rule 

r u 1 e 1 5 

Exc l sing 

rule* 

r u 1 e 1 6 

Excising 

rule 

r u 1 e 1 B 

Fxc is ing 

rule. 

r u 1 e 1 9 

Excising 

r ul e 

rul e20 

Excising 

rule. 

r u 1 e 2 1 

E jcis l ng 

rule 

r u 1 e 2 2 

Excising 

rule* 

rule23 

E *ci sing 

rule 

rule24 

Exc 1 5 ing 

rule* 

rule 25 

Exci s ing 

rule 

r ul e26 


Now you are out of clips 
Value passed = 6.000000 

Name passed is = 

ND 


A A 

A MASTER-MENU k 

k k 

k l PRELIMINARY ANALYSIS A 

k 2. CHOICE OF DISTRIBUTIONS A 

A 3 TESTS FOR GOODNESS OF FIT A 

A 4. PLOTTING OF RESULTS A 

k 5. QUIT A 

A 6 HELP FROH EXPERT SYSTEM INTERFACE A 

A 7 ANALYSIS FOR OTHER SEASONS A 

A A 

VALUE PASSED IS 6.000000 
NAME PASSED IS 
ND 


Type the serial number of desired option 


aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa 


A Hi PRELIMINARY ANALYSIS Hi A 

k- A 

A 1 STATISTICAL PARAMETERS A 

A 2. FREQUENCY PLOT A 

A 3.C.D.F PLOT (FOP ALL DISTRIBUTIONS) A 

A 4. CONFIDENCE BANDS A 

A 5. TRANSFORMATIONS A 

A 6 HELP FROM EXPERT SYSTEM INTERFACE A 

A 7 TO MASTER MENU A 

A 8. OUTLIER TESTS A 

A A 


Type the seri3l number of desired option 
0 

MEAN 3 15156 8303 
VAR= 0.18210E+0B 
SKEW* -0.93395E-01 
STUEV* 0. 42674E+04 

KURTOS IS= 0. 26669E+01 AUG . DEV IAT ION* 0.34710E+04 

THE DATA IS BEING CHECKED FOR OUTLIERS 

FOR WHICH DISTRIBUTION DO YOU WANT IT CHECKED i.e 

WHETHER i NORMAL, LOG-NORMAL ( 2 PARAMETERS) 

3 LOG-NORMALO PARAMETERS), 4 EV- 1 
5. EV- III, 6. PI- III, 7. LPT- II I 


THE PARAMETERS FOR THE ORIGINAL DATA ARE * 

MEAN= 15156 8398 
SKEW- -0 93395E-0I 
STDEV 3 0 42674E+04 
AAA FOR ORIGINAL DATA AAA 
CHECK FOR OUTLIERS BY URC METHOD 

THE PARAMETERS FOR THE LOG-TRANSFORMED DATA ARE * 
MEAN 3 0 0000 

SKEW* 0 OOOQOE+OO 
STOEV= 0 . OOOOOE+OO 

XM IN= 0 . 63620E+04 XHAX= 0 24313E+05 

FOR NORMAL DISTRIBUTION 

THE THRESHOLD VALUE FOR HIGHER OUTLIERS 26968 90 

THF THRFSHOID UALUF FOR IOWFR OUT! IFRS * 3344 770 

XH IN= 0 G3G20EKM XMAX= 0.21313E + 05 
NO OF HIGH OUTLIERS = 0 

NO. OF LOU OUTLIERS 3 0 

OUTLIERS ARF ABSENT FOR THIS DISTRIBUTION 
AAAAAAAAAAAAAAHAAAAAAAAAAAAAAAAAAAAAAAAAA 


A H% PRELIMINARY ANALYSIS Hi A 

A A 

A 1 STATISTICAL PARAMETERS A 

A 2 FREQUENCY PLOT A 

’A 3 C D.E PLOT (FOR ALL DISTRIBUTIONS) A 

A 4 CONFIDENCE BANDS A 

A 5 TRANSFORMATIONS A 

A 6. HELP FROM EXPERT SYSTEM INTERFACE A 

A 7. TO MASTER MENU A 

A 0. OUTLIER TESTS A 

A A 


Type the serial number of desired option 
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7 

A k 

k HASTER-HENU k 

k 

A 1 PRELIMINARY ANALYSIS A 

A 2 CHOICE OF DISTRIBUTIONS k 

A 3, TESTS FOR GOODNESS OF FIT A 

A A PLOTTING OF RESULTS A 

A 5 QUIT A 

A 6 HELP FROM EXPERT SYSTEM INTERFACE A 

A 7 ANALYSIS FOR OTHER SEASONS A 

A A 

VALUE PASSED IS 6 000000 
NAME PASSED IS 
HD 


Type the serial number of desired option 
2 

AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 


A M HETHOD OF ANALYSIS %% A 

k A 

A 1 HETHOD OF MOMENTS A 

A 2 METHOD OF MAXIMUM UKELYHODD A 

A 3. METHOD OF LEAST-SQUARES A 

A 4 TO HASTER MENU A 

A A 

Which method da you lil'e” 


Type the semi number of desired option 
1 

AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 


AA % t CHOICE OF DISTRIBUTION tt AA 

A A 

A 1 NORMAL A 

A 2 LOG-NORMAL < 2-PARAMETER > A 

A 3 LOG-NORMAL 13-PARAMETER) A 

k A PEARSON TYPE 3 A 

A 5 LDG-PEARSON TYPE-3 A 

A 6 EXTREME-VALUE TYPE-i A 

A 7 EXTREME-VALUE TYPE-3 A 

A B FAP — PROGRAMS A 

A G TO MASTER MENU A 

A 10 TRANSFORMATIONS A 

A U HETHOD OF ANALYSIS A 

A A 


If you W3nt lo transform the J 3 I 3AATYPF-10 

If not, then which distribution 7 type the serial number 

I 

AAAA AAAA A AAAA AA AAAAAAAA AAAAAA AAAAAAA AAAAAAAAAAAA 


A it TEST OF GOODNESS OF FIT tt AA 

A A 

A 1 CHI-SOUARE TEST A 

A 2 . KOLMOGOROV -SMIRNOV TEST A 

A 3 TO PLOTTING RESULTS A 

A 4 COHPAR IS TON OF STANDARD ERROR A 

A OF ESTIMATE FOR DIFFERENT A 

A DISTRIBUTIONS A 

A 5 TO MASTER MENU A 

A & HELP FROM EXPERT-SYSTEM INTERFACE A 

A A 


Type the serial number of desired option 
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1 

CH I— SQUARE STATISTIC 
kkkkkkkkkkkkkkkkkk kkk 

1 162785 

HO OF DEGREES OF FREEDOH 
kkkkkkkkkkkkkkkkkkkkkkkkkk 

3 

THE THEOR IT ICAL CHI-SOUARE VALUE IS : 7 B15000 

FOR 3 DEGREES OF FREEDOM 

THE FIT IS GOOD 

kkkk kkkkkkkkkkkkkkkkkkkkkkkkkk kkkkkkkkkkkkkkkkkk 


kk H PLOTTING RESULTS k 

A k 

k 1. FREQUENCY PLOT k 

k 2.C.D.F.PLQT k 

k 3. CONFIDENCE BANDS k 

k 4. TO MASTER MENU k 

k k 


Type the seri3l nuraber of desired option 


MOPMAL AMD 100 WOPHAt P 1 SIP I BUT ] OTIS 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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M0P8AL A8P 109 ODRrtAL P J5TP IB'JT 1 09S 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
CAM* 15156 9300 
•ftR- o IBZlOE'Ofl 
,('EU -0 93395F 01 

} 70£9= 0 42674F*04 

'URT05IS- 0 26669F*01 AVO DFOIATIDH= 0 34710E404 

KEAN- 15156 0390 

OAR- 0 1 02 I OE * 08 

SKEW’ *0 93395F 01 

Si DEV* 0 4267 A E i 04 

PURI05 IS 3 0 2G6G9E ♦ 0 1 AOO DEU I AT IOH= 0 347 10E+04 

I 26100 OOf f--» 4 4 4 4-- 4-' 4 -+■* A J 
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FIG. 5 . 4A HISTOGRAM FOR SEASONAL DATA FOR THE MONTH 


OF AUGUST FOR MAHANADI BASIN 
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TABLE 5.6 


PARAMETERS TOR NORMAL DISTRIBUTION FOR SEASONAL DATA IN MAHANADI BASIN 


UAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA^AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 

NORMAL DISTRIBUTION 

MAAAAAAAAAAAAAAAAAAAAAAAAAAAAA^AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 

HEAN= 15156 839844 

STANDARD DEVIATION= 4267.363281 

^aaaaaaaaaaaaaaaaaaaaaaaaaaaaaa^-^aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa 


T YEARS 

T YEAR EVENTS 

2.0 

15156.839844 

5 0 

1 0741 .425781 

10 0 

20619 064453 

20.0 

22197,990234 

50 0 

23947 609375 

100 0 

25099.796875 
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Table 5 7 


Analjrsis for Seasonal data for Hahanadi Basin 


Theoretical Chi square value x ( 95^ , 3D0F) = 7 815 


Season 

no 

Stat 

parameters 

Dis t 

suggested 

i 

Outliers 
if any 

Outliers 
cons idered 

Method 

of 

analys 

Dist 

fit 

Good 

2 , l 
* 1 
vaiu 

kurt 

skew 

high 

low 

High 

Low 

1 

8 37 

1 99 

PT3 

1 

0 

0 

0 

MM 

: PT3 

3 36 




LN3 

0 

0 









LPT 3 

1 

0 






2 

7 61 

1 45 

PT3 

1 

0 

1 

0 

MLS 

PT3 

1 87 




LN3 

t 0 

1 0 

0 

0 1 

MM 

LN3 

2 38 




LPT 3 

1 

: 0 

, 1 

0 

MLS 

: LPT 3 

1 37 

3 

2 66 

-0 09 

N 

0 

l 

0 

0 

0 

MM 

[ 

N 

, 1 16 




PT3 

1 

0 

1 

0 

MLS 

PT3 

1 37 

4 

5 30 

0 892 

PT3 

i 

0 

1 

0 

MM 

PT3 

3 36 




LH3 

0 

3 



& 






LPT 3 

1 

0 



MLS 



5 

5 24 

i 14 

LN2 

0 ! 

0 



1 

1 






PT3 

2 ' 

0 

0 

0 

HM 

PT3 

0. 92 




LN3 

0 

0 






| 


i 

LPT 3 


0 






6 

6 31 

0 628 

PT3 


0 1 

2 

0 

MLS 

PT3 

2 57 




LN3 


0 









LPT 3 

0 

1 







From cons iderations of consistency, PT3 is a good 


fit for all months and nonmonsoon season 
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5. 3 Conclusions 

The teat rune of the program FACHVES Ver 2 0 with the 
above set of data shows that the KBES is very helpful in fitting a 
probability distribution to a given set of data The distribution 
with the least chi square is taken as the best fit for the data 
set when only one nonseasonal data set is considered In case of 
seasonal data consistency of distributions ever the seasons may 
also be considered They are not implemented in the program 


CHAPTER 6 


SUMMARY , CONCLUSIONS AND SUGGESTIONS FOR FURTHER STUDY 
6. 1 Summary 

Frequency analysis of hydrologic data is a knowledge 
domain where lot of decisions have to be taken based on human 
expertise and intuition Frequency analysis being data dependent 
is subject to many errors Moreover, there is no fixed 
distribution that can be fitted to a given set of data An earlier 
study dealt with the structure and development of a ES based 
computer program FACHVES The Expert system is embedded in the 
FORTRAN program The program is written in FORTRAN language and 
the expert system has been developed for the VAX-VMS environment 
This study deals with modification and enhancement of capabilities 
of the earlier version 

In the present version, a number of subroutines have 
been added which helps in a more detailed analysis of the data 
set For example, the subroutine OUTLIER identifies the number and 
the values of the outliers present in the data* the subroutine 
CONFBND is a plotting subroutine which plota the fitted 
distribution, the sample data and the confidence bands Other 
subroutines include FREQPL and TPLOT for plotting, MLS for method 
of least squares and G0FIT3T for comparing theoretical and actual 
Chi square values in the goodness of fit test In the given 
program i a distribution can be fitted to the data after taking 
Expert advice, ii the given set of data can be tested for 
outliers, iii if outliers are present, parameters can be re 
estimated by MLS, jv the goodness of fit tested by Chi square 
test, v the fitted distributions visualised graphically by 
seeing the plots and vi which distribution is more appropriate 
to the given set of data may then be decided Hence, the 
modif ications done has enhanced the capabilities in fitting a 
distribution 

G. 3 Conclusions 

Frequency analysis of hydrologic data is a knowledge 
domain where lot of decisions are to be taken based on human 


intuition, experience and subjective judgement 

The developed package of FACHVES Ver 2 0 has been tested 
with three sets of data representing recorded streamflow data at 
two different sites in India The results of this study verify the 
satisfactory performance of the package , and in particular the 
enhancements of outlier test and graphic output 

©. 3 Suggestions for further study 

A large amount of expert knowledge is required 
and available for frequency analysis of hydrologic data The 
package can be improved for a better practical use. Some of the 
modifications that can still be done are 

1, Include more tests for testing outliers, e g , block 
procedures and robust estimates can be used 

2 A linear Golden search method can be introduced to calculate 

the optimal value of coefficient of skew which is used to 

calculate the parameters of Pearson Type 3 or Log Pearson Type 3 
distribution by MLS method 

3 A rule can be introduced which fits all the suggested 
distributions and finally suggests a single distribution suitable 
for all the seasons in the seasonal analysis of data 

4 Further enhancement of the knowledge base of the ES 

5 Enhancement of graphical capabilities like making use of 
microVAX-VMS graphics terminal 

6 Implementation of the package in other computing environments 
and 

7 It can be extended to the fitting of Time Series 
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APPEWplX A 


4*Dec- 1991 10 11 46 VAX FORTRAN V 4 b 2M 

25-Nov- 1991 14 21 14 t D 1SV 3 CBADDU CLIPS JUNKIFORCL IP P0» , 434 


0001 

c 

AAAAAAAAAAAAAAAAAAA*AAAAAAAAAAAAA AA 

0002 


SUBROUTINE MLS< IOC, I0UIH1, IOUTLO, RXX, ILPT, 2 , HC1 

0003 


CDMHQN/BLDCKl/NS.NPT, IALT 

0004 


C0HMON/BL0CK2/RX(60O) , NX , HEADING! 00) 

0005 


COHMON/GO/CSNS 

OOOG 


COMMON/PIG/GAHMA 

0007 


REAL KUT,KURT,M1 ,h2,H3 

OOOB 


DIMENSION X<60>,XX<60>,P(&0),U[60>,Y(&0),ZtGO),XLl60) 

0009 


DIMENSION TK(60>,TPNl(60),TKLPHl(60> 

0010 


DIMENSION T(60> , FX < 60 > , XLP < 60 ) , XPL ( 601 ( SDP<601,SD(60> 

0011 


DIMENSION 21(60) 

00)2 


DIMENSION TKLP(60),PLN(60) f J C 20) , XI 1 160 1 , SESULH20) 

0013 


DIMENSION IMAGE 1 (9000) 

OOM 


DIMENSION SDP1 <60) , TX ( bO > T EV1 CBO > , ET3(B0) , 22 (60 ) 

0015 


DIMENSION XI (60) , XLU 60) , IKE 1 ( 60 ) , PML< 60 ) , RXX 1 60) , TX 1 ( GO > 

0016 


N=WX/NS 

0017 


LI' = (NC-1)AH 

0010 


DO IJK = UM,LIHN 

0019 


X ( I JK-LK > =RX( UK) 

0020 


END DO 

0021 


NXX=N- I 

0022 


DO 6 K = 1 , N X X 

0023 


DO 6 I = K r NXX 

0024 


IF ( X ( K 1 GT X(M>> GO TO 6 

0025 


TEHP=X(KI 

0026 


X(K1=X( HI) 

0027 


X(1U > = IEHP 

002B 

6 

CONTINUE 

0029 


GOTO (115,115, 115,125,125,120, 120) IOC 

0030 

115 

UR ITE ( A , A ) 

0031 


(JR JTE ( A, A) NORMAL AND LDG-NORHAL DISTRIBUTIONS' 

0032 


HR ITE ( A , A ) ' AAAAAAAAAAUAAAAAAAAAAAAAAAAAAAAAAA ' 

0033 

99 

AN1 * FLOAT (N> +0 25 

0034 

750 

DO 9 1 = 1 r H 

0035 


rfMLPT EG HViEN 

0036 


P( I ) = FLOAT ( I ) /FLOAT ( < N+1 > ) 

0037 


ELSF 

0038 


P( I) -( (FI OAT ( I ) - ( 3 0/B 0H/AN1 ) 

0039 


END IF 

0040 


IF( P ( I) GI 0 51 GO TO 7 

0041 


U( 1 ) = SORT ( ALOG ( 1 /<P( I > API I) >) 1 

0042 


Fctl 

0043 


GO TD B 

0044 

7 

F*-l 

0045 


H( I ) -SORT ( ALOG( 1 /HI -PI I) 1A( 1 -PU))))) 

0046 

B 

Y ( I > =W ( 1 1 - ( 2 51551 7 »0. B02B53AU( I ) +0 01 032BA U { 1 > A U 1 1 > > / 

0047 


Ml + 1 4327O0AU1 1 ) *0 1 09269AU( I ) AU ( I ) + 0 001 308AU ( 1 > AU( I) AM( I) ) 

0040 


IF ( ILPT EO 1 > THEN 

0049 


22 ( I ) = FAY (1) 

0050 


GOTO 9 

0051 


END IF 

0052 


Z (I ) =FA Y ( 1 5 

0053 

9 

CONTINUE 

0054 


IF! ILPT EO J1G0T0 760 

0055 


IF( I0U1 EO 1 ) GOTO 929 

0056 


IF ( ILPT EO 2 ) RETURN 

0057 


IE ( IOC EO 3ITHEN 


VAX FORTRAN V4 6-244 

t D I SK3 C BA DDU CUPS JUNK 3FGRCL IP FOR, 4 


ms 


4- Pec- 1 991 10 11 46 



25-Hov-l 991 14 24 14 

0058 


CALL LN3 ( 1 ,M1 ,H2,M3,0,0> 

0059 


VR HE ( A , A ) ' THE SHIFT PARAMETER IN LN3 IS ',M3 

0060 


DO 1*1, H 

0061 


XI I) = X< D-M3 

0062 


ENDDQ 

0063 


END IF 

0064 


DO 4 1=1, N 

0065 


xi m=x< n 

0066 


XLl ( I > = ALOG ( X < I) > 

0067 

4 

CONTINUE 

0068 


UR ITE < A , 10) 

0069 


WRITE! A, 11) ( I f Xl(l> ,XL1 ( I > ,P( I) ,UU> ,ZI I ) , 1= IOUTH I M ,N- IOUTLO) 

0070 

10 

FORMAT 1 3X , I' , 9X , 'XI I) ' ,9X, 1X11) r 9X, PII) ,9X, U(I> ,9X, Z!I> /) 

007 1 

n 

FORMAT (IX, 13, 5X , F8 2,3X,F8 2,5X,F8 2.5X.F8 2,5X,r8 2) 

0072 


NN=N~ I IOUTH 1+ IOUTLO) 

0073 


DO I s IOUT111* 1 , N- IOUTLO 

0074 


XIII- IOUTH 1 ) =X 1 ( 1 ) 

0075 


XLl I I - IOUTH T ) = XL 1 ( I) 

0076 


Z1 ( I-10UIHI) =Z I I) 

0077 


ENDDO 

0079 


IF( IOC EO 1 )THEN 

0079 


CALL HLSREGINN.Zl , XI, RESULT, IOC, 1 > 

0080 


DO r=i,N 

0081 


RXX < I ) = RESULT (2)*2(I)ARESULTI4) 

0082 


ENDDO 

0083 


ELSE 

0084 


CALL HLSREGINN.Zl, XLl .RESULT, IOC, 2) 

0085 


DO 1 = 1, N 

0086 


RXX { I 1 = RESULT (2)»Z( MARES ULK4> 

0087 


ENDDO 

0088 


END IF 

0089 


IF! ILPT OF 3 ) RETURN 

0090 

760 

DO 1=1, N 

0091 


Zl I) =Z2( I) 

00 92 


ENRRO 

0093 


RETURN 

0094 

C 


0095 

120 

WRITE! A, A) 

0096 


WR ITE 1 A , A ) ' EXTREME-VALUE DISTRIBUTION' 

0097 


WRITE! A ,A> ' AAAAAAA AAA AAAAAAAAAAA AAAAAAA 

0090 

770 

DO 12 1 = 1, N 

0099 


AN1 =TLOAT ( N ) +0 12 

0100 


IF! ILPT EO 1 ) THEN 

0101 


PI I ) = FLOAT ! I>/FLOAT<NU> 

0102 


ELSE 

0103 


PHL! 1 ) = ( (FLOAT ! !)-0 44>/ANl> 

0104 


Pd)-PHL(l) 

0105 


END IF 

0106 


T ! I ) = 1 O/PU) 

0107 


T1=AL0G(AL0G(T( 11/IT! Il-l 0))) 

9100 


IF! ILPT EQ 1 ) THEN 

109 


IKEl<n>-(0 5772+T1 ) ASORT! 6 0)/J 141562 

no 


ELSE 

in 


TK( I ) =- ( 0 5772 Ml ) ASQRT ! 6 01/3 141562 

112 


END IF 

M3 


X1UJ=X(M 

0114 

* 

XLl! I)=ALOG(X( 111 


105 


011 5 

0116 12 

0117 

0118 

0119 

0120 
0121 
0122 

0123 

0124 

0125 

0120 19 

0127 M 

0128 
0129 
0)30 

0131 

0132 

0133 

0134 
013 5 

0136 

0137 

0138 

0139 
OHO 

0141 

0142 

0143 

0144 

0145 62 

0146 

0147 

0148 

0149 

0150 

0151 

0152 61 

0153 

0154 

0155 

0156 

0157 
0)58 

0159 

0160 
0161 
0162 

0163 

0164 

0165 

0166 

0167 

0168 63 

0169 

0170 

0171 


4-Pec- l 991 10 11 46 VAX FORTRAN V4 6-244 

25-Nov- 1991 1 4 24 14 4DISP3 CBADDU CLIPS JUNKIFOSCL IP F0R,4 ji 

I r ( I LPT EO 1 > XL 1 ( I1«AL0G(X( 111 
CONTINUE 

ir ( ILPT EO 1 1G0T0 700 
IF ( 1LPT EO 2JTHEN 
DO 1 = 1, N 
Zm = TKM) 

ENPDO 
RETURN 
END IF 

URITEIA, 13) 

UR ITEM, 141 ( I,X1<I>,XLJ( J),r( H , T 11) , TK I |J , j« IDUTH I* 1 , W-IOUTLO? 

FORMAT ( 3X , ' 1 ' , 9X , X(l) ,9X , 'LX< I > ' , 9X , p (11 , 9X , ' TC l ) ' , 9X , KT(I)V) 

FORMAT! IX, 13, 5X,F8 2,5X,F8 2,5X,F8 2,5X,F0 2,5X,F8 2) 

NN=N-< IOUTHI+ IOUILO > 

DO HIOUTHHl.N-lOUTLO 
X) ( I- 10UT1I I ) =X 1 ( I) 

XL 1 ( 1 - I OUT It I ) ®X LI (I) 
r Ml ( I- IOUTH I ) =PML< I ) 

ENDDO 

IF ( IOC EO 6 ) TIIFN 

LALI PARA/H XI , NN, AHE, OAR, SKW, AU7, STP, 2) 

El SUH = 0 0 

B 1 =STPA50RT ( 6 0)73 1415 
Al=AME-0 5772AD1 
BO 1=1 , HN 
EV I (11=0 0 

EY 1 < I ) = EY 1 ( inFXP(-(Xl(l)/BH) 

El SUM ElSUHtEYl ( I) 

ENPPO 
H I L = 0 
NXX=HW-i 
DO 61 K= 1 ,NXX 
DO 6) l = l, , NXX 

iriEYl (K) GT EY1< HI >> GO TO 61 
IEHP= EY 1 ( V ) 

EYI (1(1 = EYI (PI) 

EY 1 ( I * 1 ) =TEHP 
CONTINUE 

IFINIL LQ DGOIO 63 
PI =0 0 
PO 1 = 1, HN 
P( I)=ALOG(PHU 1)1 
P1=PHP( I) 

ENPPO 

ALAH=F1SUH/P1 

UR ITE ( A , A ) ' FOR THE EXPONENTIAL PISTRIBUTION ' 

WRITEU , A) 'THE HETHOB OF LEAST SQUARE ERRORS CONSTANT = \ALAH 
DO 1 = 1 ,N 

EY1 ( I)=H MAALAH 

EY 1 1 I >=ALOG(EY 1 ( I) )A(-P1> 

ENPPO 
N I L= 1 
GOTO 62 
PO I - 1 , N 

rxx< n =m< 1 1 

ENPPO 

CALI r ARAMtEYl , N, AHEAN, VAR ,SKEW,I'URT ,SIPEV, 2 ) 
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MLS 


4-Dec- 1 991 

10 



25-Nov- 1991 

14 

0172 


CALL CH1S0RKD, AMEAN , 5TDEV , EY1 ,M,EY1 ,HS,NPT, IALT , IQC,1> 


0173 


IMMTE(A.A)' THE CHJ-SOUARE VALUE OBTAINED USING MLS DUE 10' 


0174 


WR ITE ( A , A ) THE PRESENCE OF OUILIFRS IS 


0175 


CONTINUE 


01 76 


ELSE 


0177 


CALL PARAMI XL1 , NN, AHEAN, VAR I ,SI'EW, KURT,SIDEV , 2 > 


0178 


E3SUH=0 0 


0179 


B1 =5TDEVAS0RT(t 0)/3 1415 


01 BO 


A1 “AMEAN-0 5772AD1 


0181 


DO 1 = 1 T NN 


0182 


EY3 ( I > =0 0 


0103 


EY3<1)>EY3( I ) +EXP( -< XL1 (I)/P1 ) 1 


0184 


E3SUH =E3SUH>EY3 < I> 


0105 


ENDDO 


0186 


N IL 1 =0 


0187 

64 

HXX=NN- J 


018B 


DO 60 «=1 r HXX 


0189 


DO 60 1=1', NXX 


0190 


IF(EY3(k) GT EY3( 1*1 >1 GO TO 60 


0191 


TEHP = EY3< K) 


0192 


EY3T K1 =£T3( I * 1 ) 


0193 


EY3< 1+ 1 1=TEMP 


0194 

60 

CONTINUE 


0195 


IF ( N IL1 ED 1 JGOTD 65 


0196 


PI =0 0 


0397 


DO 1=1, WW 


0190 


P< I>=ALOG(Phll I) 1 


0199 


P1=P1+P( I) 


0200 


ENDDO 


0301 


ALAH= E3SUM/P 1 


0202 


HR I IE ( A , A > ' FOR THE EXPONENTIAL D 1ST R IPUT ION ' 


0203 


UR 1TE ( A , A ) ' THE HETHOD OF LEAST SQUARE ERRORS CONSTANT = ' , ALAN 

0204 


DO 1= 1 ,N 


0205 


EY3I I > = P 1 I) AALAH 


0206 


EY3 ( I) =ALQG( ADS ( EY3 ( I ) ) ) A ( -B1 ) 


0207 


ENDDO 


0208 


N IL1 = 1 


0209 


GOTO 64 


0210 

65 

DO 1= 1 ,H 


0211 


RXXI I ) = ET3 1 1 1 


0212 


ENDDO 


0213 


CAI L I ARAfU E Y3 , N , AHEAN , VAR , SI'EU,I URT ( STPEV,2) 


0214 


CALL CH ISORTI 8 , AHEAN , STDEV , FY3 ,N , EY3,NS , NPT , IALT , IOC , 1 ) 


0215 


WR ITE ( A , A 1 ' THE CHI-SOUARE VALUE OBIAINED USING MLS DUE TO' 


0216 


WR ITE < A , A 1 ' IHE PRESENCE OF OUTLIERS IS 


0217 


CONTINUE 


02 U? 


END IF 


0219 


imLFT.EQ 3 1RETURN 


0220 

700 

DO 1=1, N 


0221 


zm=iKEim 


0222 


ENDDO 


0223 


RETURN 


0224 

C 



0225 

125 

WR ITE ( A , A ) 


0226 


WR 1 TE ( A , A ) ' PEARSON AND LOG-PEARSON DISTRIBUTION' 


0227 


WRITE! A, A) ' AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA' 


0228 


DO I = ),N 
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HIS 


4-Dec- 1 991 10 

11 

46 



25-Hov- 1991 14 

24 

14 

0229 


TX1< I>=X< I) 



0230 


EUDDO 



0231 

126 

DO I=l f H 



0232 


XPL4 I ) =ALOG( X 1 1)) 



0233 


XL ( I ) S AL0G1 0 ( X ( I 1 1 



0234 


ENPDO 



0235 


CALL PARAN4 X , M , PHEAN, PYAR , CSX ,PKURI , SDK, 4 ) 



0236 


CALL PARAH< XL ,N , PLMEAN , PLYAR ,CSPL f PLKURT,SDPL, 4 ) 



0237 


DO 19 I B I , H 



0238 


AN1 =FLOAT ( W ) + 0 25 



0239 


P4 1 ) = 4 < FLOAT 4 I ) - (3 0/0 0M/ANW 



0240 

19 

CONTINUE 



024 J 


PLK =CSPL/6 0 



0242 


PK=CSX/6 0 



0243 


IOU 1 - 1 



0244 


GOTO 99 



0245 

929 

DU 21 1 = 1, N 



0246 


IK1 = 7<1H<Z(I)AA2-1 OIAFLKM 0/3 0A(Z(I)AA3-6 0AZII>nPLKAA2 



0247 


I K 2 = ■ 4 Z 4 11 A A 2 - 1 0) APLKAA3+Z4 1 > APLKA A4 ♦ 1 0/3 OAPLKAAS 



0248 


TK3 = Zmt(Z( [IAA2-I OlAHOI 0/3 0A(Z(I)AA3-6 OAZ ( I ) > A P K A A 2 



0249 


Tit 4 =- 4 7 4 I ) A A2 - 1 01APKAA3+Z4 I1APKAA4M 0/3 0APKAA5 



0250 


TKLP<I)»rKim2 



0251 


TK < I ) = IK3 + TK4 



0252 

21 

CONTINUE 



0253 


I0U1=2 



0254 


IF 4 ILPT EO 2 ) then 



0255 


IF 4 IOC EO 4 ) THEN 



0256 


DO 1 = 1 ,N 



0257 


Z C X ) 9 TK 4 11 



0258 


ENDDQ 



0259 


RETURN 



0260 


ELSE 



0261 


DO f = 1 , N 



0262 


24 I ) -TKLP4 I) 



0263 


END DO 



0264 


RETURN 



0265 


ENDIF 



0266 


FNDir 



0267 


WR ITE4 A , 22 ) 



0268 


WRITE (A. 23 >4 I, XI 11, XL 4 I ) . P< 11 , TK 4 1 > , TXLPt I) , 1= IOUTHI + 1 .N-IOUTLO) 


0269 

22 

FQRHAT43X, ' I ' ,9X, X4 I ) ' , 9X , ' LX4 I ) ' ,9X t ' P4 1 )' , 7X , ' KTP4 1 ) ' , 



0270 


17X, 'KTLP4 1) '/> 



0271 

23 

FORMAT (IX, 13 , 5X , F8 2,5X,F0 2,5X,F8 2,5X,F8 2,5X,FB 2J 



0272 


NN=N- ( IOUTH I ♦ IOUILO) 



0273 


DO 1= IOUTH I* 1 ,N- lOUTLO 



0274 


XI < I- JOI/TH V> IJ 



0275 


XL 14 I- IOUTH 1 > = XL4 I ) 



0276 


TKNm-IOUIIH)=TX|I) 



0277 


TKLF'Nl 4 1- IOUTH I ) = TKLP( I) 



0278 


END 00 



0279 


IF 4 IOC EQ 4JTHFN 



0280 


CALL NLSREH 4 NN , TKHl , XI , RESULT , IOC f 1 > 



0281 


DO I - 1 , N 



0202 


RXX 4 1 ) 3 RESULT 4 2 > + 1 K 4 1 ) ARESULT 4 4 I 



0283 


ENDDO 



0204 


ELSE 



0285 


CALL NLSREGf MW, TNLPN1 1 XLI , RESULT, IOC ,21 
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25-Nov- 1991 11 21 11 1DISK3 CBADDU CLIPS JUHKTFORCUP FOB T 434 

I 

02B6 DO 1*1, H 

0297 RXX( I ) ^RESULT < 2 > + TKLPI 1 > AftESULK 4 1 

0208 EMDDQ 

02Q9 ENDIF 

0290 IFICSNS EO l)GOTO 076 

0291 IFI IOC EO 11THEN 

0292 UR ITE< A , A ) ' PEARSON TYPE 3 D 1ST R I PUT IDN ' 

0293 UR ITE< A , A ) ' AAAAAAAAAAAAAA AAAAAAAAAAAAA ' 

0291 BETA* ( 2 O/CSXJ AA2 

0295 FSP-rsX 

0296 ELSE 

0297 UR1IE( A , A ) ' LOG-PEARSON TYPE 3 0 1 STR 1 BUT ION ' 

02 q 0 WR IT E f A , A 1 AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA ' 

0299 BET A= ( 2 O/CSPL > AA2 

0300 FS1' = CSPL 

0301 END IF 

0302 ALPHA = RESIILT(1)/SQRT(E<ETA) 

0303 GAPf’A = RESULT(2)-RESULT(1) AS0RT18ETA) 

0304 UR ITE ( A , A ) 

0305 UR ITE ( A , A > ' METHOD OF LEAST SOUARES ' 

0306 UR ITE( A ( A ) ' 

0307 UR TTE( A , A J 

0308 UR ITE ( A , A 1 ' — — - ' 

0309 UR ITE ( A , A ) 

0310 UR I T E < A , A ) ' ALPHA = ' f ALPHA, ' HEAN = ' , RESULT < 2) 

0311 UR JTE ( A f A ) ' BETA = ' , BETA, ' SJPEV = ' , RESULT ( 4) 

0312 WR 1 1 E ( A , A } ' GAHMA = ' , GAMMA f SKEW = ' , FSK 

0313 WR ITE ( A , A > 

0311 UR I TE ( A , A ) ' ' 

0315 076 IF ( ILPT EO 3) RETURN 

0316 END 
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0001 c 

0002 

0003 C 

0004 c 

0005 

0006 
0007 
0 00 B 

0009 

0010 
0011 
0012 
0013 
OOH 

0015 

0016 

0017 

0018 

0019 

0020 6 
0021 

0022 

0023 1 

0024 C 

0025 

0026 
0027 
0020 

0029 

0030 

0031 

0032 

0033 

0034 

0035 C 

0036 

0037 

0038 

0039 

0040 

0041 

0042 20 

0043 

0044 

0045 

0046 15 

0047 C 

0048 C 

0049 2 

0050 

0051 

0052 

0053 

0054 C 

0055 

0056 

0057 , C 


AAAAAAAA MM AAAMUA AAA AMAAAmAAAAA AAA AAA MAMMA* 
SUBROUTINE FREOPLtH.X) 

THIS ROUTINE DRAWS A HISTOGRAM FOR THE GIVEN DATA 

aaaaaaaaaaaaaamaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa 

DIMENSION X ( 50 ) , X 1 ( 50 ) , J ( 1 0 ) 

CHARACTER STRA(IO) 

INCLUDE ' ( ISMGDEF ) ' 

INTEGER SMG1CREATE VIRTUAL DISPLAY , SMGiCREATE PASTEBOARD 

INTEGER SHG1DRAU_L INE , SNfll PUT CHAPS 

INTEGER SHGI PASTE VIRTUAL D ISPLA Y , SHG1DRAW RECTANGLE 

INTEGER DISPLAY 1 ( PAST El , ROWS (COLUMNS, STATUS 

SIATUS = SHG» DELEI E_PASIEBOARD( PASTE 1 ) 

WX=W-1 
DO 6 K= 1 , NX 
DO 6 I = K t H X 

IFtX(K) GT Xd'in GD TO 6 

TEHP-xm 

X<K) = X< IU ) 

X ( 1 ♦ 1 ) = TEHP 

CONTINUE 

DO l 1= 1 1 N 

XI ( l)=X( 1 1/1000 0 

CONTINUE 

DX1 = ( X ( 1 > ~ X ( M ) 1/700 0 
DX = < FLOAT! INI ( PX1 ) >1/10 0 
Y 1 =FLQAT ( INI ( XI (N 1 )) 
lEdNKYl/IO 0) EO 01THEH 
IY1 = INI < Y l A 1 0 0) 

ipx= rwrc2o*0x j 

ELSE 

1Y 1= INT < Y 1 1 
I D X = INT ( 2ADX ) 

EHU1F 


LC = 0 

DO 15 1=1,10 
Y2~ Y 1 *DX 
j( n = o 
00 20 K=I,H 

1F(X1 U!) GI Y 1 AND XHH LT Y2)J(I)-J(I)M 

CONTINUE 

LC=LCM 

IF ( Y2 GT XI (15 )GOTO 2 

Y1=Y2 

CONTIHUE 


R0WS=25 

COLUNNS=65 

STATUS =SHG»CREATE_VIRTUAL DISPLAY 

< ROUS, COLUMNS , D1SPLAY1 ,SHG*M_BORDER) 
IF( NOT STATUS) CALL LIB*SIGNAL(XVAL(STATUS) > 

STATUS = SHG1CPEATE_PASTEBQARD 1PASTE1 ) 

IF ( NOT. STATUS) CALL L I BISIGNAL ( XV AL( STATUS)} 


110 


FIEQPL 


4-Dec- 1991 
4-Dec 1991 

005 B 


STATUS=SMG»ORAU LINE C D ISPLAT 1 , IB , 7 , 10 , 60) 

0059 


SmuS-SMGmAI/J.IHE {CISPLAri ,1 ,7, 18,7) 

0060 


L C 1 = 7 

0061 


LC2=I2 

0062 


DO 30 1=1 , LC 

0063 


LR1 a IS* J< 13 

0004 


IEIJI1) GT IS ) LB1* 1 0 

0065 


IF I LSI ED 3 0) THEN 

0066 


SmuS*SHG*[JRAU LINE ( D 1SPLAY1 , IB, LC1 , 1B.LC2) 

0067 


GOTO 31 

0066 


END IF 

0069 


STATUS=SHG*DRAU RECTANGLE (DISPLAY 1 , LSI , LC1 , 1 8, LC2) 

0070 

31 

LC1=LC2 

0071 


LC2=LC2*5 

0072 

30 

CONTINUE 

0073 


IF ( NOT STATUS) CALL L IBIS IGNAL (ZVAL (5TAIU5) ) 

0074 

C 


0075 


STA7US=SHGIPUT CHARS (P1SPLAY1, 

0076 

1 

'HlSIOGRAH FOR THE GIVEN DATA ,21,20) 

0077 

C 


0079 


DO 61 K-1,8 

0079 


DO 61 f*L,0 

OOBO 


I F { J ( K ) GT JU + m GO TO 61 

009! 


TEHP=J(K) 

0002 


j (k > =j ( i* l > 

00S3 


J(IM ) *TEHF 

0004 

61 

CONTINUE 

0095 

C 


0066 


DO 40 1= 1 , J ( 1 ) +2 

0087 


KR= 18- 1 

0068 


K= I 

0089 


STrtTUS=SNGIDRAM LINE (DISPLAY! , KR, 6 , FB, 71 

0090 

40 

CONTINUE 

0091 

C 


0092 


DO 130 1=1 , J( 1 1+2 

0093 


If (I/IO ED 01 THEN 

0094 


5TRA( 1 >=CHAR(32 1 

0095 


ELSE 

0096 


STRAM) = CHARU/IOMB) 

0097 


END IF 

0098 


SIRA ( 2 ) = CHARI 1- 1/ 10A 1 0< 48 ) 

0099 


STATUS=SMG*PUT CHARS ( DISPLAY ,STRA , IB- 1 , 4) 

0100 

130 

CONTINUE 

0101 

C 


0102 


DO 131 1=1,7 

0103 


STRAI?) = CHAR(m-iri/10AI0 + 46) 

0104 


ST R A 1 1 ) = CHAR (JY1/IO + 49) 

0105 


STATUS=SHGIPUT CHARS (DISPL ATI, ST RA, 19, 11-1)410*7) 

0106 


IV1 = IYI+IDX 

0107 

131 

CONTINUE 

0108 

C 


0109 


SrrtTUS=SMG«PUr CHARS (DISPLAY!, 'F' ,9,1) 

0110 


SIATU5=SMG*PUI CHARS (DISPLAY!, 'It' ,10,1) 

0111 


SrrtTUS=SMGtPUT CHARS f DISPLAY! , , 1 i » 1 ) 

0112 


STATUS=SMG1PUT CHARS ( DISPLAY 1 , ' Q ,12,1) 

0113 


STATUS=SHG(fUr CHARS (DISPLAY], 'U',13,1 1 

0114 


STATU5=SHGIPUT CHARS (DISPLAYl.'E ,14,1) 
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FRE0PI 


4-Dpc-1 991 

11 

07 

09 

VAX FORTRAN V4 6-244 


4-Df c- 1 9 q 1 

11 

06 

59 

IDISK2 tJK BADDUJFR FOR, 3 

0115 


STATUS=SHGIPUT CHARS « DISPLAY l , H', 15,11 





0116 


STATUS =SHG4 PUT _CHARS IDlSPLAtl, C ,16,11 





0117 


STATUS =SHGIPUT_CHARS (DISPLAY! , 'T'* 17,1) 





0118 

C 






0119 


STATUS = SMGtPASTE_V IRTUAL^P IS PLAY ( DISPLAY 1 ,PASTE1 , 4 , 1 5 ) 





0120 


Iff HOT STATUS) CALL L tsis IGHALUVAKSTATUS ) 1 





0121 


WRITE! 4,100) 





012 2 

100 

FORMATt 1H1> 





0123 


RETURN 





0124 

3 

END 






112 
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0001 C 

0002 C 

0003 

0004 

0005 

0006 
0007 
0000 

0009 

0010 
0011 
0012 

0013 

0014 

0015 

0016 
0017 
001S 

0019 

0020 
0021 
0022 

0023 

0024 

0025 

0026 
0027 
0020 

0029 

0030 

0031 

0032 

0033 

0034 

0035 

0036 

0037 

0038 

0039 

0040 
00 41 

0042 6 

0043 

0044 

0045 

0046 

0047 
0040 

0049 

0050 

0051 

0052 

0053 

0054 

0055 

0056 

0057 15 


AAAAAAtAAAAAAAAAAAAAAAAAAAA 

CHECKS FOR OUTLIERS IN ANX DISTRIBUTION 

SUBROUTINE OUTLIER! IOUTHI, lOUTLD.WC) 

CONHON/BLDCKl/HS.HPT, IALT 

COMMON/ BLQC) 2/RX < 600 ) , NX ,HE AD IHG < 90 ) 

REAL KURT,K0,K0T t Nl t N2,H3 

DJHENSJDH X(60),IX(60) ( riILE(fiO) f KDn01) f EyH60) f ET3(60> 

DIHENS 10N TEST! 100) , SPH ( 59 1 ,SPL ( 1 B) 

DATA V 0 / 2 036,2 08B,2 134,2 175,2 213,2 247,2 279,2 309,2 335 

1.2 361,2 3B5,2 408,2 429,2 44B.2 467,2 486,2 502,2 519,2 534 

2.2 549,2 563,2.577,2 591,2 604,2 C 1C, 2 628,2 639,2 650,2 661 

3.2 671,2 602,2,692,2 7,2 71,2 719,2 727,2 736,2 744,2 753,2 760 

4.2 76B , 2 775,2 782,2 789,2 796,2 904,2 81,2 817,2 8230,2 03 

5.2 837,2 042,2 04B,2 854,2 86,2 066,2 071,2 876,2 802,2 887 

6.2 B93, 2 398,2 9,2 907,2 912,2 917,2 921,2 926,2 93,2 935 

7.2 94,2 q 4 4 , 2 948,2 952,2 956,2 957,2 96,2 965,2 969,2 973 
B , 2 977,2 °B I , 2 985,2 909,2 99,2 996,3 0,3 0,3 0,3 01,3 013 

9.3 017,3 02,3 023,3 026,3 029,3 033,3 036,3 039,3.042,3 045/ 

DATA SFH / 0 9750,0 8709,0 7679,0 6830,0 6161,0 5612,0 5157 

1.0 4775,0 4450,0 4107,0 3924,0 37313,0 3538,0 3346,0 32170 

2.0 30B96 ,0 29614,0 20332,0 2705,0 26173,0 25295,0 24417,0 2354 

3.0 22916,0 22293,0 2167,0 21046,0 20423,0 1980,0 19396,0 18992 

4.0 10508.0 ! 0 1 B4 , 0 1778,0 17376,0 16972,0 16568,0 16164,0 1576 

5.0 15537,0 15315,0 15093,0 1487,0 14647,0 14425,0 14203,0 1390 

6.0 13757 0 13535,0 13313,0 1309,0 12867,0 12645,0 12423,0 1220 

7.0 11977,0 11755,0 11533,0 1131/ 

DATA SPL / 0 00B4 4 , 0 00424,0 00255,0 00170,0 00122,0 000913 

1.0 00071,0 000566,0 000478,0 000388,0 0003345,0 000281 

2.0 000247,0 000213,0 000190,0 000167,0 000151,0.000135/ 

N=NX/NS 

LK=( HC-1 )AK 
DO IJK=LK*1 , LK+H 
X< IJK-LX ) = RX ( IJK) 

END DO 
NXX=N-l 
DO 6 KM,NXX 
DO 6 I =P , HXX 

ir(X(K) GT X< HI ) ) GO TO 6 
TEHP = X<I') 

X<K)-X( H 1 1 
X( HIHTEMP 
CONTINUE 
SS = 0 0 
DO I-],N 
SS-S5*X( 1 ) 

END DO 

CALL PARAH< X , N, AHE, VARI , SKU.KUT , STD,2) 

REWIND 20 
UR ITE(A , A) 

UR ITE( A , A ) THE DATA IS BEING CHECKED FOR OUTLIERS' 

UR 1TE< A , A ) ' FOR WHICH DISTRIBUTION DO XQU WANT IT CHECI'EB 1 e ' 

HR ITE(A, A ) ' UHEIHER 1 NORMAL, 2 LOG-NOPMAL(2 PARAMETERS ) ' 

UR ITE(A,A) ' 3 L0G-N0RMAU3 PARAMETERS), 4. EV-I ' 

UR ITEU, A ) ' 5 EV- III, 6 PT-III, 7 LPT- 1 1 1 ' 

REAP (A ,A ) I0U 

GOTO <110,15,15,115,15,130, 15) IOU 
IF < IOU EO 3) THEN 


OUTLIER 


4-Dec-1991 10 



25-Nov- 1991 H 

0058 


CALL LH3U,H1 ,M2,H3 t 0,0) 

0059 


X( I)=X< I ) -H3 

0060 


ENDIF 

006) 


DO 212 1*1, H 

0062 


ixj n=xm 

0063 


IF(TXU) LE 0 0 > TX( 11*0 1 

0064 

212 

X( 1 ) *ALDG< TX (1>) 

0065 


SS*0 0 

0066 


DO I - 1 T H 

0067 


SS*SS»X( I) 

0069 


END DO 

0069 


CALL PARAM(X,N,AMEAH,VAR.Si:EU t )'URT,STDEV,2) 

0070 


IF 1 IOU EO 2 OR IOU EO 3 ? GOTO 142 

0071 


IF ( IOU EQ 5 ) GOTO 120 

0072 


IF< IOU EO 71G0T0 131 

0073 

110 

WRIIEIA.A) 

0074 


UR ITE( A,A) THE PARAMETERS FOR THE ORIGINAL DATA APE ' 

0 07’5 


WRirE{A,!92)AHE,SI'U,STD 

0076 

192 

FORMAT ( 2X, MEAN* ' , 2X , F 10 4,/,2X, SKEW* ' , 2X , E 1 2 5,/, 

0077 


1 2 X r STDEV* ' , 2X, E)2 5) 

0078 

142 

TYPE A,' AAA FOR ORIGINAL PAIA AAA' 

0079 


TYPP A, 'CHECH FOR OUTLIERS BY URC METHOD' 

0080 

C 

THE RANGE OF A DATA 

0081 


Xh IN* 1 OOOOGOOOO 0 

0082 


XHAX=0 00000001 

0083 


IOUTH 1 = 0 

0084 


IOUUO’O 

ooes 


NN=N-9 

0086 


DO I* 1 ,N 

0087 


IF 1X1 ]) LI XMJN)XMJW=XI I) 

0088 


ENDDD 

0089 


DO 1=1, N 

0090 


IF < X < I) GT XMAX ) XHAX=X ( I) 

0091 


ENDDO 

0092 


UR I TE ( A , A ) 

0093 


UR ITE( A,A > ’ THE PARAMETERS FOR THE LOG-TRANSFORHED DATA ARE ' 

0094 


WRITE! A, 190 ) AHEAN,SKEU,SIDEV 

0095 

190 

FORMAT! 2X, 'MEAN S ',2X I FI(J 4,/, 2X, 'SKEW* ' , 2X, EJ2 5,/,2X, 

0096 


l'STDEV*' ,2X f E12 5,7) 

0097 


HRlTE(A,200>XMlN,XhAX 

0098 

200 

FORMAT (2X, ' XH IN* ',E12 5,5X,'XHAX= ',E12 5) 

0099 


FF< IOU EO I OR rOU EO 6 OR IOU EO 7 1 THEN 

0100 


AH=AME 

0)01 


ST=STD 

0102 


ELSE 

0103 


AM-AHEAN 

0)04 


ST=STDEV 

0)05 


ENDIF 

0)06 


YH = AM>KO(NH) AST 

0107 


YL*AK-KOCNN 1 AST 

0108 


IF ( IOU ED 2 OR IOU EQ 31THEN 

0109 


YH1=EXP< YH) 

0110 


YLl-EXPfTL) 

om 


UR ITEM , A) ' FOR LOQ*NORMAL DISTRIBUTION 

01)2 


UR ITEM , A) ' THE THRESHOLD VALUE FOR HIGHER OUTLIERS ,YH1 

0113 


UR ITEM , A) ' THE THRESHOLD VALUE FOR LOWER OUTLIERS ( YL1 

0114 


XH IN=EXP(XH IN) 
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OUTLIER 


4-Dec~1991 10 29 27 

VAX FORTRAN V4 6-244 



25~Hov-1991 14 24 14 

4DISM3 I PAPDU CLIPS JUHK3FORCLIP FOR, 4 

on 5 


XMAX=EXP!XMAX> 


0216 


ELSE 


0117 


UR 1TE ( A r * ) ' FOR NORMAL D I SIR 1 &U T I OH ' 


Oils 


WR ITE ( A , A ) THE THRESHOLD VALUE FOR HIGHER OUTLIERS ',YH 


Oils 


WRITE! A, A) THE THRESHOLD VALUE FOR LOWER OUILIERS \U 


0120 


END I F 


0121 


DO 3 1 = 1, H 


0122 


IFUCJ) GE TH) I OUTM I = I0UIH1* 1 


0123 


IF < X ( f 1 LE YU IOUTLO=IOUILGM 


0124 

3 

CONTINUE 


0125 


WRITE! A, 100)XMIH,XMAX, IOUTH J , IOU7L0 


0126 

too 

FORMAT ( IX , ' XM 1N= ,E12 5,2X,'XHAX= ,E12 3,/, IX, 


0127 


l'NO OF HIGH OUTLIERS = ,15,/, IX, NO OF LOU DUTII£RS= ',15) 


0128 


IF! I OU EG 6 OR IOU EG 71GGTQ 501 


0129 


IF! IOU EQ 2 OR IOU EO 3)IHEN 


0130 


00 1=1, H 


0131 


IF! IOU EO 3>Xm=X(JMM3 


0132 


X! I)-EXP(X( 1) 1 


0132 


ENDdG 


0134 


END IF 


ons 


IF! !0U EO 1 > THEN 


0136 


CALL PAEST! IOU ,M, X , IOUTH l , 10UTLO, I) 


0137 


ELSE 


0130 


CALL PAEST! IOU,H,K, I0UTHI , IOUTLO, I) 


0139 


END IF 


0140 


GOTO 500 


0141 

c 

DATA IS INITIALLY TRANSFORMED FROH EV-I TYPE 10 EXPONENTIAL TYPE 


0142 

c 

AND THEN TESTED FOR OUTLIERS 


0H3 

115 

El SUH=0 0 


0144 


B 1 =STDAS0RT <G 0)/3 1415 


0145 


A 1 =AME-0 5772ADI 


0146 


WR ITE! A , A) A= ,A1,' B= ',81 


0147 


DO 1= 1 , N 


0)40 


EY ) f J ) =0 D 


0149 


EY1 ! I ) = EY 1 ! I ) « EXP ( - ( X ( I > /B » > ) 


0150 


E1SUH=EISUH+£Y1(I> 


0151 


EHDDO 


0152 


WRIIEU.A)' E1SUH = ,E1SUH 


05 53 


NXXt»-J 


0154 


DO 61 K=l,HXX 


0155 


DO 61 1=K , HXX 


0156 


IF(EYKK) GT £11(1+155 GO TO 61 


0157 


TEMP=EY1 (K> 


0158 


EY 1 ( K ) = EY 1 { I ♦ 1 ) 


0159 


EYl! IU1=TEHP 


0160 

61 

CONTINUE 


0161 


ALAHPl = EXP!-<fU/Pin 


0162 


ALAHB2=EXP C Al/8 1 > 


0163 


WR ITE ! A , A ) ' FOR THE EXPONENTIAL DISTRIBUTION ' 


0164 


WRITE!*, A)' ORIGIN = 0 ' 


0165 


WR ITE! A , A 5 ' SCALE PARAMETER FOR GREATEST VALUE DIST = ' , ALAMP 1 


0266 


WRirE(A,AJ' SCALE PARAMETER FOR SMALLEST VALUE DIST * ',ALA«P2 


0167 


GOTO 510 


0168 

120 

E3SUH=0 0 


0169 


B3=STPEVA50RT(6 0)/3 1415 


0170 


A3-AHEAN-0 5772AB3 


0)7 1 


WRI7E2A, A) J A 3 ',A3, ' 8= ',83 



115 


UTLIER 


4-Dec- I 
25-Nov- 1 

172 


DO 1«1,N 

173 


EY3 ( I ) =0 0 

'174 


ET3( I ) =EY3 C I>«EXP(-U< I1/D31 > 

175 


E3SUM=E3SUM+EY3( I> 

'17£> 


ENPDO 

'177 


WRITE(M>' E3SUH = ,E35UM 

'178 


NXX=N- 1 

'179 


DO 62 K E l ,NXX 

tl BO 


DO 62 I-K.NXX 

'181 


IF ( E Y3 ( K > GT EY3U + m GO TO 62 

082 


l£HP'EY3tK) 

H83 


EY3 IK > = EY3t 1+1) 

1184 


EY3 C M> = TEHP 

>185 

62 

CONTINUE 

1186 


ALANB3=FXP(-A3/'B3> 

118? 


ALAMR4'EXP( A3/B3 ) 

nee 


URIIUA.A) FOR THE EXPONENTIAL DISTRIBUTION 

>189 


UR IIE ( A ( A ) ' ORIGIN = 0 ’ 

>190 


UR 1TE( A , A) SCALE PARAMETER FOR GREATEST VALUE DIST = ' 

>191 


UR 1TE( A , A ) ' SCALE PARAMETER FOR SMALLEST VALUE DIST = 

>192 


GOTO 510 

>193 

C 

TEST FDR OUTLIERS IN THE EXPONENTIAL DISTRIBUTION 

1194 

C 

CONrECUIIUE TESTS FOR A SINGLE UPPER OUTLIER IN EXPONENT 

>195 

c 

AN UPPER OUTLIER IN THE EXT VALUE DIST IS TRANSFORMED 

>1 96 

c 

LOUER OUTLIER IN THE EXPONENTIAL DISTRIBUTION 

>197 

510 

Nl=N/2 

>198 


HI 1 = N 

>199 


N2 = N 

9200 


IQUTH I s O 

>201 


I0UTLD=0 

0202 


IF( IOU EO 4 5 THEN 

>203 


SSH=E1 SUH 

0204 


DO 1 = 1, Ml 

>205 


TESrm=ETl(I)/SSH 

0206 


IF < N1 1 GT 120 ) THEN 

0207 


SPHUN11A1 0/ ( ( 1 0* (TEST 1 1 >/ ( l 0-TEST 4 1) > 1 ) )AA< N1 1 - 1 > 

0208 


END IF 

0209 


IF1N11 GT 60 AND HI l LE 1 20 > THEN 

0210 


5PH 1 =0 1 131 - ( (0 1131-0 0632)760 OYAIN11-OOI 

0211 


ENriJF 

0212 


N12=Nl 1-1 

0213 


IF (HI 2 GT 59HHEH 

0214 


SPH ( M12) =0 0 

0215 


ELSE 

0216 


SPH1 =0 0 

0217 


END IF 

0218 


IF ( 5PH(N1 2 ) LI , TEST ( I > AND SPH1 EQ 0 OR 

0219 


1 GPU l Ml 2) TO 0 AND SPH 1 LT TEST ( I > JTHEN 

0220 


1UIITLU 1 OUT 10*1 

0221 


SSH=SSH-EY 1(1) 

0222 


N1 1 =N1 1-1 

0223 


GOTO 151 

0224 


ELSE 

0225 


GOTO 159 

0226 


END IF 

0227 

151 

END DO 

0228 


ELSE 


,ALAMB4 


AS 
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>229 

)230 

0231 

0232 

0233 

0234 

0235 

0236 

0237 
0230 

0239 

0240 

0241 

0242 

0243 

0244 

0245 

0246 

0247 
0240 

0249 

0250 

0251 

0252 

0253 153 

0254 

0255 159 

0256 

0257 

0258 C 

0259 C 

0260 C 

0261 520 

0262 

0263 

0264 

0265 

0266 

0267 

0268 

0269 

0270 

0271 

0272 

0273 

0274 

0275 

0276 

0277 

0278 

0279 
0200 
0281 
0282 

0283 

0284 
' 0285 


SSH=E3SUH 
DO 1 = 1, HI 

TEST! I1 = ET3< Il/SSH 
IF(Hn GT 1 20 ) THEN 

SPHl =N1 1 M 0/Ml 0MIESTm/(l 0'TESTM>>)>»*A(NU-11 
END IF 

IF( HI 1 GT 60 AND Nil LE 1 201 THEN 

SPHl =0 1371-MO 1371-0 O759)/60 0>ft<Nll-S0) 

END IF 

H12 = Hn-l 

IF 1 HI 2 GT 59 ) THEN 

5PH<Nl2) =0 0 

ELSE 

SPHl =0 0 
EHD IF 

IF < SPH ( N 1 2 ) LT TEST ( I 1 AMD SPHl EO 0 OR 
1SFH(N12) EO 0 AND SPHl LI, TEST! I) HHEH 
IOUTLO= IOUILO + 1 
SSH = SSH-ET3 ( I) 

HI 1=N1 1 - 1 
GOTO 153 
ELSE 

GOTO 159 
EHD IF 
ENDDO 
EHD IF 

VRITEU,*}' NO OF LOW OUTLIERS IN THE EV DIST = '.lOUTLO 
WRITE(A,A) 

GOTO 520 

CONCECUT IVE TESTS FDR A SINGLE LOVER OUTLIER IH EXPONENTIAL DIST 
AN LOWER OUTLIER IN THE EXT VALUE DIST IS TRANSFORMED AS 
UPPER OUTLIER IN THE EXPONENTIAL DISTRIBUTION 
HI I =W 

IF( IOU EO 4 1 THEN 
SS=EJSUN 
DO 1=N2 , HI , -1 
TESK 1 1 = ET I < Il/SS 
IFfWl 1 GT 100 ) THEN 

SPLIsNlIM 0/((l 0MIESim/(l O-TEST ( II » > 11 A A CHI I - 1 > 

END IF 

IF (N1 1 GT 20 AND Nil LE 3011HEN 

SPLl =0 000135-4 (0 000135-0 Q0Q05G9) / 101* ( N1 1-20) 

END IF 

IFINU GI 30 AND Nil LE 40 > THEN 

SPL 1=0 OOOO509-MO 0000589-0 00003291/1 0>A<N1 1-30) 

END IF 

IF ( HI 1 GT 40 AND Nil LE 501THEH 

SPL1 = 0 0000329- ( ( 0 0000329-0 00002091/101 A 4 H L 1-40) 

EHD IF 

IE ( HI 1 GT 50 AMD Nil LE 10Q1THEH 

GPL 1 = 0 0000209-((0 0000209-0 0000051B1/50) ki N1 1-501 

END IF 

N12=Nt 1-2 

J£ 4HJ 2 GT 1 0 1 THEN 

S P L ( N 1 2 1 = 0 0 

ELSE 

SPL1 =0 0 


1 
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OUTLIER 


l-D'C-1991 10 29 27 
25-Mov-1991 M 24 14 


VAX FORTRAN V4 6-244 

f DISK3 CBADOU CLIPS JUNK 3 f OR CL IP FO&J43 


0286 

0207 

029B 

0209 

0290 

0291 

0292 

0293 

0294 

0295 

0296 

0297 152 
0290 

0299 

0300 

0301 

0302 

0303 

0304 

0305 

0306 

0307 
0300 

0309 

0310 

0311 

0312 

0313 

0314 

0315 

0316 

0317 
0310 

0319 

0320 

0321 

0322 

0323 

0324 

0325 

0326 

0327 
0320 

0329 

0330 

0331 

0332 

0333 154 

0334 

0335 150 

0336 

0337 

0338 

0339 

0340 

0341 

0342 


END IF 

IF 1 SPL1N12) GI TEST! I) AND SPL1 EQ 0 OR 
1 SPL1H1 2) ED 0 AND SPL1 GT TESTC I) > THEN 
IOUTHIMOUTHHI 
SS*SS-EY1U) 

IFU EQ 11GOTO 150 
«1 1-'NH-l 
GOTO 152 
ELSE 

GOTO 150 
END IF 
END DO 
ELSE 

SS=E3SUH 
DO r*N2,Wl,-I 
TEST ( I > = EY3( 1 )/SS 
IF1N11 GT 1 OO > THEN 

SPLI =N1 141 0/Ul 0*(TE5T( 11/(1 0-IEST4I > ))> ) Aftdtl 1 - 1 ) 

ENl'IF 

IF 4 Ml 1 GT 20 AND Nil LE . 30 ) THEN 

SPL1 =0 0000264-1 (0 00002G4-0 00001 161/10) A(N1 1-201 
END IF 

IF1N11 GT 30 AND Nil LE . 40) THEN 

SPL 1 *0 00001 16-1 (0 00001 16-0. 00000644 > /10) A (HI 1 -30 ) 

END IF 

rHWM GT 40 AND Nil IE 50JIHEW 

SPL1 =0 00000644- 1 ( 0 00000644-0 000004 101/1 0) A (Nil -40) 

END IF 

IF1N11 GI 50. AND Nil LE 100) THEN 

SPLI =0 0000041-1 10 0000041-0 00000102 1 /SO I MMII -50 1 

END IF 

N12=Nl 1-2 

IF1N12 GI 1 B > THEN 

SPL (N12) =0 0 

ELSE 

SPLI =0 0 
END 1 F 

IF1SPL1N12) GI TEST 1 1 ) AND SPLL EQ O.QR 
1SPL1M12) EO 0 AND SPLI GT TEST 1 1 1 1TIIEN 
IOUTH ! = IOUTH 1+1 
SS-SS-EY31 I) 

IFU EQ 11G0T0 15B 
N 1 1 = N 1 1 - 1 
GDTO 154 
ELSE 

GDTO 150 
END IF 
END DO 
ENPIf 

WRIIE1 A, A) ' NO OF HIGH OUILIERS IN THE EV DISI = ' , IOUTH 1 

1F1I0U EQ 4) THEN 

CALL PAE51 1 IOUpH, X, IOUTHI , I0UTLO, 1 ) 

ELSE 

DO I-l.N 

XI I > =EXP(X 1 I ) ) 

ENDDO 

CALL PAE5I1 I0U t N t X, IOUTHI, IOUTLO, 1 1 
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OUTLIER 




4- Dec- I 99 I 

10 29 

27 





25-Hav-mi 

14 24 

14 

0343 


END IF 





0344 


GOTO 500 





0345 

C 

TRANSFORM P-III D 1ST TO NORMAL D 1ST AND 

THEM CHECK FOR OUTLIERS 


0346 

130 

CALL PARAM(X,N r AHE r VARI r Sk'V, KUT, 5T0, 2> 





0347 


UR ITE( A , A > 





0348 


WRITEUpA) THE PARAMETERS FOR THE ORIGINAL 

DATA ARE ' 



0343 


VRITE(A,191)AME,5KU t STD 





0350 

191 

F0RMAr(2X, 'MEAN* ' ,2X, F10 4,/,2X, 'SKEW*', 

,2X r 

E12 5,/, 



0351 


1 2X , STDEV» ' , 2X , E 1 2 5) 





0352 


MRITEJA.A) 





0353 

131 

DO l*l,H 





0354 


ixm=x< I) 





0355 


ENODO 





0356 


IFF I DU EO 6) THEN 





0357 


CALL IPT (N , 1 ,AME,STD,SKU,X) 





0350 


CALL FARAH<X,N,AME,VARI,5KU,KUI,STD,2> 





0350 


GOTO 142 





0360 


ELSE 





0361 


CALL IPT 4 H f 1 , AHEAN , ST DEV , SKEW, X ) 





0362 


CALL PARAHf X,N r ANE, VAR I f SKW,KUT , SID, 2) 





0363 


GOIO 142 





0364 


ENP1F 





0365 

501 

IFF 10U EO 7 > THEN 





0366 


DO !-»,« 





036? 


TXI I )=EXP ( IX ( I ) > 





0368 


ENDDO 





0363 


END IF 





0370 


IFMDU EO GUHEH 





0371 


CALL PAEST(IOU,N,TX, I0UTH1 , IOUTLO 1 11 





0372 


ELSE 





0373 


CALL PAESK IOU,N,TX, IOUTHI, IOUUO F 1 > 





0374 


END1F 





0375 

500 

RETURN 





0376 


END 
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